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Description 
Technical Field 

5 [0001] The present invention relates to a moving picture coding method and a decoding method, and particularly to 
a prediction coding method referring to plural coded pictures preceding in display order or plural coded pictures following 
in display order or plural pictures both preceding and following in display order. 

Background Art 

10 

[0002] Generally, information volume is compressed by reducing redundancy in temporal and spatial directions for 
moving picture coding. Therefore, motion deriving and motion compensation are performed on block-to-block basis 
referring to a preceding or a following picture, and a coding is performed for a difference value between an obtained 
predictive picture and a current picture for inter picture prediction coding aimed at reducing a temporal redundancy. 

15 [0003] In a moving picture coding method H.26L, which is currently under standardization, a picture with only intra 
picture prediction coding (I picture), a picture for which inter picture prediction coding is performed referring to one 
picture (hereinafter P picture) and a picture for which inter picture prediction coding is performed referring to two pictures 
preceding in display order or two pictures following in display order or each one of pictures preceding and following in 
display order (hereinafter B picture) are proposed. 

20 [0004] Fig. 1 is an illustration showing an example of a reference relation between each picture according to above- 
mentioned moving picture coding method and reference pictures. 

[0005] In picture 11 intra picture prediction coding is performed without a reference picture, and in picture P10 inter 
picture prediction coding is performed referring to a picture preceding in display order, P7. In a picture B6 inter picture 
prediction coding is performed referring to two pictures preceding in display order, in a picture B12 inter picture pre- 
25 diction coding is performed referring to two pictures following in display order and in a picture B1 8 inter picture prediction 
coding is performed referring to each one of pictures preceding and following in display order. 

[0006] A direct mode is one of prediction mode of bi-predictions which perform inter picture prediction coding referring 
to each of pictures preceding and following in display order. In the direct mode, motion vectors for a block to be coded 
are not coded in the bit stream directly, and two motion vectors for actual motion compensation are calculated referring 
30 to a motion vector of a co-located block in a coded picture close to the picture including the block to be coded in display 
order, and a predictive block is generated. 

[0007] Rg. 2 shows an example that a coded picture which is referred to in order to determine a motion vector in the 
direct mode contains a motion vector which refers to a preceding picture in display order. "P" indicated by a vertical 
line in Rg. 2 has nothing to do with a picture type and it shows a mere picture. In Rg. 2, for example, a picture P83, 

35 in which bi-prediction is performed referring to pictures P82 and P84, is a current picture to be coded. If it is assumed 
that a block with coding in the picture P83 is a block MB81 , a motion vector of the block MB81 is determined using a 
motion vector of a co-located block MBB2 in the picture P84 which is a coded backward reference picture. Since the 
block MB82 contains only one motion vector MV81 as a motion vector, two motion vectors MV82 and MV83 to be 
obtained are calculated directly by applying a scaling to a motion vector MV81 and a time interval TR81 based on 

40 Equation 1 (a) and Equation 1 (b). 

MV82 = MV81/TR81 x TR82 Equation 1 (a) 

45 

MV83 = -MV81/TR81 x TR83 Equation 1 (b) 

[0008] In these equations, the time interval TR81 shows an interval between the picture P84 and the picture P82, 
that is, a time interval between the picture P84 and a reference picture indicated by the motion vector MV81 . The time 
so interval TR82 shows a time interval between the picture P83 and a reference picture indicated by the motion vector 
MV82. The time interval TR83 shows a time interval between the picture P83 and a reference picture indicated by the 
motion vector MV83. 

[0009] The direct mode includes two methods, the temporal prediction already explained and the spatial prediction, 
and the spatial prediction is explained below. In the spatial prediction in the direct mode, for example, coding is per- 
55 formed on a macroblock of 16 x 16 pixels basis, and a motion vector, which is obtained referring to a picture closest 
from a current picture to be coded in display order, is selected from motion vectors in three macroblocks in the neighbor 
of the current macroblock to be coded, and the selected motion vector is a motion vector for the current macroblock 
to be coded. If three motion vectors refer to a same picture, a median value is selected. If two of three motion vectors 
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refer to a picture closest from a current picture to be coded in display order, the remainder is considered as "0" vector, 
and a median value of these values is selected. If only 1 motion vector refers to a picture closest from a current picture 
to be coded in display order, this motion vector is selected. Thus a motion vector is not coded for a current macroblock 
to be coded in the direct mode, and motion prediction is performed using a motion vector contained in other macroblock. 

5 [0010] Fig. 3A is an illustration showing an example of motion vector predicting method in the case that a picture 
preceding in a B picture in display order is referred to using a conventional spatial predicting method in the direct mode. 
In this Fig. 3A, P indicates a P picture, B indicates a B picture and numbers assigned to picture types in right four 
pictures indicate an order in which each picture is coded. It should be assumed that a macroblock diagonally shaded 
in a picture B4 is a current macroblock to be coded. When a motion vector of a current macroblock to be coded is 

10 calculated using a spatial predicting method in the direct mode, first, three coded macroblocks (area shaded with 
broken lines) are selected from the neighbor of the current macroblock to be coded. Explanation of a method for 
selecting three neighboring macroblocks is omitted here. Motion vectors in coded three macroblocks have been cal- 
culated and stored already. There is a case that the motion vector is obtained referring different pictures for each 
macroblock even if macroblocks are in a same picture. Reference indices in reference pictures used for coding each 

is macroblock can show which picture is referred to by the three neighboring macroblocks respectively from. Detail of 
reference indices will be explained later. 

[001 1] Now, for example, it is assumed that three neighboring macroblocks are selected for a current macroblock to 
be coded shown in Fig. 3A, and motion vectors in each coded macroblock are a motion vector a, b and c respectively. 
Here, it is assumed that the motion vector and the motion vector b are obtained referring to a P picture with a picture 

20 number 11 of "11 ", and the motion vector c is obtained referring to a P picture with the picture number 11 of "8 n . In this 
case, among these motion vectors, a, b, and c, the motion vectors a and b which refer to a picture closest to a current 
picture to be coded in order or display time are candidates for a motion vector of a current macroblock to be coded. In 
this case, the motion vector c is considered as "0", and a median value of these three motion vectors a, b and c is 
selected and determined as a motion vector of the current macroblock to be coded. 

25 [0012] However, a coding method such as MPEG-4 can perform coding for each macroblock in a picture using a 
field structure and a frame structure. Therefore, in a coding method such as MPEG-4, there is a case that a macroblock 
coded in the field structure and a macroblock coded in the frame structure are mixed in one frame of reference frame. 
Even in such a case, if three macroblocks in the neighbor of a current macroblock to be coded are coded in the same 
structure as the current macroblock to be coded, it is possible to derive a motion vector of the current macroblock to 

30 be coded using the above-mentioned spatial predicting method in the direct mode without any problems. That is, a 
case that three neighboring macroblocks are coded in the frame structure for a current macroblock to be coded in the 
frame structure, or a case that three neighboring macroblocks are coded in the field structure for a current macroblock 
to be coded in the field structure. The former case is as already explained. In the latter case, by using three motion 
vectors corresponding to top fields of three neighboring macroblocks for a top field of a current macroblock to be coded, 

35 and by using three motion vectors corresponding to bottom fields of three neighboring macroblocks for a bottom field 
of the current macroblock to be coded, a motion vector of the current macroblock to be coded can be derived for the 
top field and the bottom field respectively using the above-mentioned method. 

[0013] However, in the temporal prediction method in the direct mode, since the above-mentioned block contains 
plural motion vectors for temporal prediction in the direct mode when in a block with intra picture prediction coding, 

40 motion compensation in the direct mode is performed, if a block of which motion vector is referred to belongs to a B 
picture such as B6 shown in Fig. 1 , a problem that a calculation of motion vector by a scaling based on Equation 1 can 
not be applied directly occurs. Furthermore, there is a case that precision of motion vector value (half pixel precision 
and quarter pixel precision, for example) does not meet predetermined precision since dividing operation is performed 
after the calculation of motion vector. 

45 [0014] When a current macroblock to be coded and one of neighboring macroblocks are coded in a different structure 
for a spatial prediction, it is not specified which one of a field structure or a frame structure is used for coding the current 
macroblock to be coded, and a method for selecting a motion vector of the current macroblock to be coded from motion 
vectors of neighboring macroblocks coded in both the field structure and the frame structure is not specified. 
[001 5] The first object of the present invention is to offer a motion vector prediction method in temporal direction with 

50 high precision in the direct mode even if a block of which motion vector is referred to belongs to a B picture. 

[0016] The second object of the present invention is to offer a motion vector prediction method in spatial direction 
with high precision in the direct mode even if a block of which motion vector is referred to belongs to a B picture. 

Disclosure of Invention 

55 

[0017] In order to achieve above object, a motion vector calculation method according to the present invention is a 
motion vector calculation method for inter picture prediction with reference to a plurality of pictures. The motion vector 
calculation method comprises a referring step of referring to a plurality of pictures preceding in display order or a 
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plurality of pictures following in display order or a plurality of pictures both preceding and following in display order, 
and a motion compensating step, with reference to a motion vector of a co-located block in a picture other than a 
picture to which an inter picture predictive block belongs when performing motion compensation of the inter picture 
predictive block, for calculating a motion vector of the inter picture predictive block using at least one motion vector, 
5 which satisfies a predetermined condition, among motion vectors already calculated for the co-located block of which 
motion vector is referred to. Therefore according to the motion vector calculation method of the present invention, when 
in a block with inter picture predictive coding motion compensation is performed referring to a motion vector of a co- 
located block in a coded other picture, and when a block of which motion vector is referred to contains a plurality of 
motion vectors, it is possible to actualize the motion compensation without contradiction by generating one motion 

10 vector used for scaling among the plurality of motion vectors. 

[0018] For the motion vector calculation method according to the present invention, in the reference step, each one 
of pictures selected from a first picture order and a second picture order can be referred to. Here the first picture order 
is the order in which identifying numbers are assigned to pictures in ascending sequence giving priority to a picture 
precedent in display order, and the second picture order is the order in which identifying numbers are assigned to 

15 pictures in ascending sequence giving priority to a picture following in display order. In the motion compensating step, 
a motion vector, which refers to a picture in the first picture order, of the block of which motion vector is referred to may 
be used. For the method, even if a block of which motion vector is referred to belongs to the B picture, the one used 
for the motion compensation of the block with inter picture prediction can be determined as a motion vector referring 
to a picture in the first picture order, and motion vector calculation method by scaling can be applied. 

20 [0019] Moreover, another motion vector calculation method according to the present invention includes an assigning 
step, a first selecting step and a deriving step. The assigning step is for assigning one of a first reference index or a 
second reference index to the coded picture. Here, the first reference index and the second reference index are used 
for selecting at lest one of a first reference picture or a second reference picture which are referred to when obtaining 
a block in a current picture to be coded by motion compensation from a plurality of coded pictures stored in a storing 

25 unit. The first selecting step is for selecting a motion vector indicating a median value of the motion vectors, when 
performing motion compensation of a block in the current picture to be coded, there are a plurality of motion vectors 
containing the first reference indices among motion vectors of blocks in the neighbor of the current picture to be coded. 
The deriving step is for deriving a motion vector referring to a picture preceding the current picture to be coded in 
display order or a picture following the current picture to be coded in display order or pictures preceding and following 

30 the current picture to be coded in display order using the motion vector selected in the first selecting step. Therefore, 
when motion compensation is performed for a block in the current picture to be coded and when there are a plurality 
of motion vectors containing the first reference index among motion vectors of blocks in the neighbor of the block in 
the current picture to be coded, the motion vector of the current picture to be coded can be derived using a motion 
vector indicating a median value of motion vectors. 

35 [0020] For the motion vector calculation method according to the present invention, a motion vector indicating a 
median value of the smallest first reference indices may be further selected from motion vectors containing the first 
reference indices in the first selecting step. 

[0021] Further Information about Technical Background to this Application 

40 Japanese Patent Application No. 2002 - 118598 filed April 19, 2002: 

Japanese Patent Application No. 2002 - 121053 filed April 23, 2002: 

Japanese Patent Application No. 2002 - 1 56266 filed May 29, 2002: 

Japanese Patent Application No. 2002 - 1 77889 filed June 1 9, 2002: 

Japanese Patent Application No. 2002 - 193027 filed July 2, 2002: 
45 Japanese Patent Application No. 2002 - 204713 filed July 12, 2002: 

Japanese Patent Application No. 2002 - 262151 filed September 6, 2002: 

Japanese Patent Application No. 2002 - 290542 filed October 2, 2002: 

Japanese Patent Application No. 2002 - 323096 filed November 6, 2002: 

Japanese Patent Application No. 60 / 378643 filed May 9, 2002: 
50 Japanese Patent Application No. 60 / 378954 filed May 1 0, 2002: 

, are incorporated herein by reference. 

Brief Description of Drawings 

55 

[0022] 

Fig. 1 is a schematic diagram showing a referential relation of pictures of a conventional example. 
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Fig. 2 is a schematic diagram showing an operation in a conventional direct mode. 

Fig. 3A is an illustration showing an example of a motion vector predicting method when a temporally preceding 
picture is referred to in a B picture using a spatial predicting method of a conventional direct mode. 
Fig. 3B is an illustration showing an example of a reference list generated in each current picture to be coded. 
5 Fig. 4 is an explanatory illustration of picture numbers and reference indices. 

Fig. 5 is an illustration showing a concept of a picture coding signal format of a conventional picture coding appa- 
ratus. 

Fig. 6 is a block diagram showing an operation of coding according to the first and the second embodiments of 
this invention. 

10 Fig. 7 is a schematic diagram showing an operation when a block of which motion vector is referred to in the direct 

mode contains two motion vectors which refer to preceding time in display order. 

Fig. 8 is a schematic diagram comparing a referential relation of pictures in display order and coding order. 
Fig. 9 is a schematic diagram showing an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to following time in display order. 
'5 Rg. 1 0 is a schematic diagram comparing a referential relation of pictures in the display order and the coding order. 

Fig. 11 is a block diagram showing an operation of decoding according to the fifth and sixth embodiments of the 
present invention. 

Fig. 12 is a schematic diagram showing an operation when a block of which motion vector is referred to in the 
direct mode contains two motion vectors which refer to preceding time in display order. 
20 Rg. 1 3 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 

mode contains two motion vectors which refer to following time in display order. 

Fig. 1 4 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to following time in display order. 

Fig. 15 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
25 mode contains two motion vectors which refer to preceding time in display order. 

Fig. 1 6 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to preceding time in display order. 

Fig. 1 7 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to preceding time in display order. 
30 Fig. 1 8 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 

mode contains two motion vectors which refer to preceding time in display order. 

Fig. 1 9 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to following time in display order. 

Fig. 20 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
35 mode contains two motion vectors which refer to following time in display order. 

Fig. 21 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to following time in display order. 

Fig. 22 is a schematic diagram to show an operation when a block of which motion vector is referred to in the direct 
mode contains two motion vectors which refer to following time in display order. 
40 Rg. 23 is a schematic diagram to show an operation when a motion vector of a neighboring block is referred to in 

the direct mode. 

Fig. 24 is an illustration showing a bit stream. 

Fig. 25 is an illustration showing a relation between a current block to be coded and a block in the neighbor of the 
current block to be coded. 

45 Fig. 26 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 27 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 28 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 29 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 30 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 
so Fig. 31 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 32 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 33 is an illustration showing a motion vector contained in a block in the neighbor of a current block to be coded. 

Fig. 34 is an illustration showing a procedure for determining a motion vector to be used in the direct mode. 

Fig. 35 is an illustration showing a relation between a current block to be coded and a block in the neighbor of the 
55 current block to be coded. 

Fig. 36 is an illustration showing a procedure for determining a motion vector of a current block to be coded using 

a value of a reference index. 

Fig. 37 is an illustration showing bi-prediction in the direct mode when a motion vector referring to a picture stored 
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in a long term picture buffer is only one. 

Fig. 38 is an illustration showing bi-prediction in the direct mode when motion vectors referring to a picture stored 
in the long term picture buffer are two. 

Fig. 39 is an illustration showing a process flow of a motion vector calculation method. 
5 Fig. 40 is a block diagram showing a configuration of a moving picture coding apparatus 100 according to the 

eleventh embodiment of the present invention. 

Fig. 41 A is an illustration showing an order of frames inputted into the moving picture coding apparatus 100 in 
order of time on picture-to-picture basis. 

Fig. 41 B is an illustration showing the case that the order of frames shown in Fig. 41 A is reordered in the coding 
10 order. 

Fig. 42 is an illustration showing a structure of a reference picture list to explain the first embodiment 
Fig. 43A is a flow chart showing an example of a motion vector calculation procedure using a spatial predicting 
method in the direct mode when a macroblock pair to be coded in a field structure and a macroblock pair to be 
coded in a frame structure are mixed. 

is Fig. 43B is an illustration showing an example of a location of neighboring macroblock pairs to which the present 

invention is applied when a current macroblock pair to be coded is coded in a frame structure. 
Fig. 43C is an illustration showing an example of location of neighboring macroblock pairs to which the present 
invention is applied when a current macroblock pair to be coded is coded in a field structure. 
Fig. 44 is an illustration showing a data configuration of a macroblock pair when coding is performed in a frame 

20 structure, and a data configuration of a macroblock pair when coding is performed in a field structure. 

Fig. 45 is a flow chart showing a detailed processing procedure in a step S302 shown in Fig. 43. 
Fig. 46 is an indicator chart showing a relation between reference field indices and reference frame indices. 
Fig. 47 is a flow chart showing a detailed processing procedure in a step S303 shown in Fig. 43. 
Fig. 48 is an illustration showing a relation of position between a current macroblock pair to be coded and neigh- 

25 boring macroblock pairs in order to explain the first embodiment. 

Fig. 49 is an illustration showing a positional relation between a current macroblock pair to be coded and neigh- 
boring macroblock pairs in order to explain the first embodiment. 

Fig. 50 is an illustration showing an example of a data configuration of a bit stream 700 generated by a bit stream 
generating unit 1 04. 

30 Fig. 51 is a block diagram showing a configuration of a moving picture decoding apparatus BOO which decodes 

the bit stream 700 shown in Fig. 50. 

Fig. 52A is an illustration showing an example of a physical format of a flexible disk which is a body of a storage 
medium. 

Fig. 52B is an illustration showing an external view of the flexible disk viewed from the front, a configuration of the 
35 section and the flexible disk. 

Fig. 52C is an illustration showing a configuration to record and read the above-mentioned program on a flexible 
disk, FD. 

Fig. 53 is a block diagram showing an entire configuration of contents supply system implementing a contents 
delivery service. 

40 Fig. 54 is an illustration showing an example of an appearance of a cell phone. 

Fig. 55 is a block diagram showing a configuration of the cell phone. 

Fig. 56 is an illustration to show a device performing the coding or the decoding process shown in above embod- 
iments and a system using the device. 

45 Best Mode for Carrying Out the Invention 

[0023] The present invention is to solve problems of the conventional technology, and aims at proposing a moving 
picture coding method and a decoding method which can determine a motion vector used for motion compensation 
without contradiction even if a block of which motion vector is referred to in a direct mode is a B picture. First, reference 

so indices are explained here. 

[0024] Fig. 3B is an illustration showing an example of a reference picture list 1 0 generated for each current picture 
to be coded. In the reference picture list 10 shown in Fig. 3B, pictures are shown preceding and following a B picture 
in display order with one B picture at the center, and pictures to which the B picture can refer, picture types, a picture 
number 11, the first reference index 12 and the second reference index 13 are shown. The picture number 11 is, for 

55 example, a number showing an order in which each picture is coded. The first reference index 12 is the first index 
showing a relative positional relation between a current picture to be coded and neighboring pictures, and, for example, 
is used mainly as an index when a current picture to be coded refers to a picture preceding in display order. A list of 
the first reference index 12 is called a "reference index listO (listO)" or "the first reference index list M . Moreover, the 
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reference index is called a relative index. First, in the reference picture list 1 0 shown in Fig. 3B, integer which is advanced 
by "1 " is assigned to a value of the first reference index 12 from "0" from the closest to a current picture to be coded 
in a time sequence for a reference picture preceding a current picture to be coded in display order. Next, after a value 
advanced by "1" from "0" is assigned to all reference pictures following a current picture to be coded in display order, 
5 following values are assigned to reference pictures following the current picture to be coded in display order from 
closest to the current picture to be coded in display order. 

[0025] The second reference index 13 is the second index showing a relative positional relation between a current 
picture to be coded and neighboring pictures, and, for example, is used mainly as an index when a current picture to 
be coded refers to a picture following in display order. A list of the second reference index 1 3 is called "reference index 

10 listl (Iist1) w or "the second reference index list". First, integer which is advanced by M 1" is assigned to a value of the 
second reference index 13 is from "0" from the closest to a current picture to be coded in display order. Next, after a 
value advanced by "1" from "0" is assigned to all reference pictures following a current picture to be coded in display 
order, following values are assigned to reference pictures preceding a current picture to be coded in display order from 
the closest value to a current picture to be coded in display order. Therefore, it is found in the reference picture list 10 

is that as for the first reference index 1 2 and the second reference index, a reference picture with smaller reference index 
value is closer to the current picture to be coded in display order. A method for assigning a reference index number in 
initial state of is explained above, however, the method for assigning a reference index number can be changed on a 
picture-by-picture basis or a slice-by-slice basis. In the method for assigning a reference index number is, for example, 
a small number may be assigned to a picture far in display order, however, such a reference index is used, for example, 

20 when coding efficiency is improved by referring to the picture far in display order. In other words, since reference indices 
in a block are presented by variable length code words and data with shorter lengths are assigned to the indices of 
the smaller values, by assigning smaller reference index to the reference picture which improves coding efficiency if 
it is referred to, the amount of codes in reference indices is reduced and further coding efficiency is improved. 
[0026] Fig. 4 is an explanatory illustration for picture numbers and reference indices. Fig. 4 shows an example of 

25 the reference picture list, and shows a reference picture, a picture number and a reference index used when coding 
the B picture at the center (indicated by a broken line). Fig. 4A shows the case assigning reference indices by the 
method for assigning reference indices in initial state explained using Fig. 3. 

[0027] Fig. 5 is a conceptual diagram of a picture coding signal format of a conventional picture coding apparatus. 
Picture indicates a coding signal for one picture, Header indicates a header coding signal included in the head of a 

30 picture, Blockl indicates a coding signal in a block coded in a direct mode, Block2 indicates a coding signal in a block 
coded by an interpolation prediction other than the direct mode, RidxO and Ridxl are the first reference index and the 
second reference index respectively, and MVO and MV1 are the first motion vector and the second motion vector 
respectively. The coded block Block2 has two reference indices RidxO and Ridxl in a coding signal in this order for 
indicating two reference pictures to be used for motion compensation. Moreover, the first motion vector MV1 and the 

35 second motion vector MV2 are coded in the coding signal of the coded block Block2 in this order. It can be judged by 
PredType that which of the reference indices RidxO and/or Ridxl is used. A picture (the first reference picture) referred 
to by the first motion vector MVO is indicated by the first reference index RidxO, and a picture (the second reference 
picture) referred to by the second motion vector MV1 is indicated by the second reference index Ridxl . For example, 
when it is indicated that pictures are referred to bi-directionally by the motion vectors MVO and MV1 , RidxO and Ridxl 

40 are used, when it is indicated that pictures are referred to uni-di recti on ally by one of motion vector MVO or MV1 , one 
of RidxO or Ridxl corresponding to the motion vector is used, and when the direct mode is indicated, neither RidxO 
nor Ridxl are used. The first reference picture is specified by the first reference index and generally has display time 
preceding a current picture to be coded, and the second reference picture is specified by the second reference index 
and generally has display time following the current picture to be coded. However, as the method for assigning reference 

45 indices in Fig. 4 shows, there is a case that the first reference picture contains display time following the current picture 
to be coded and the second reference picture contains display time preceding the current picture to be coded. The 
first reference index RidxO is a reference index indicating the first reference picture referred to by the first motion vector 
MVO of the block Block2, and the second reference index Ridxl is a reference index indicating the second reference 
picture referred to by the second motion vector MV1 of the block Biock2. 

50 [0028] On the other hand, an assignment of reference pictures to reference indices can be changed arbitrarily by 
indicating explicitly using a memory control signal in a coded signal (RPSL in Header in Fig. 5). This makes it possible 
to change the reference picture with the second reference index "0" to an arbitrary reference picture. For example, as 
shown in Fig. 4B, assignment of reference indices to picture numbers can be changed. 

[0029] Thus, since assignment of reference picture to reference indices can be changed arbitrarily and the change 
55 of the assignment of reference pictures to reference indices generally assigns a smaller reference index to a picture 
which improves coding efficiency if selected as a reference picture, coding efficiency can be improved by using a motion 
vector, which refers to a picture of which reference index is the smallest, as a motion vector used in the direct mode. 
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(First Embodiment) 

[0030] A moving picture coding method according to the first embodiment of the present invention is explained using 
the block diagram shown in Fig. 6. 

5 [0031] A current moving picture to be coded is inputted into a frame memory 101 in a display order on a picture-to- 
picture basis, and reordered in a coding order. Each picture is divided into a group called a block, which is 1 6 (horizontal) 
x 16 (vertical) pixels in size, for example, and following processes are performed on a block-to-block basis. 
[0032] A block read from the frame memory 101 is inputted into a motion vector detecting unit 106. Here, a motion 
vector of a current block to be coded is detected using a decoded picture of a coded picture stored in the frame memory 

10 1 05 as a reference picture. In this case, in a mode selecting unit 1 07, an optimum prediction mode is determined with 
reference to a motion vector obtained in the motion vector detecting unit 106 and a motion vector used in a coded 
picture stored in a motion vector storing unit 108. A prediction mode obtained in the mode selecting unit 107 and a 
motion vector used in the obtained mode are inputted to a difference calculating unit 109, and a predictive residual 
picture is generated by calculating a difference from a current block to be coded, and coding is performed in a predictive 

15 residual coding unit 102. Moreover, the motion vector used in the mode obtained in the mode selecting unit 107 is 
stored in a motion vector storing unit 108 in order to be used for coding by'following blocks and pictures. An above 
processing flow is an operation when an inter picture prediction coding is selected, however, a switch 111 switches to 
an intra picture prediction coding. Eventually, variable length coding is performed for control information, such as a 
motion vector, and picture information, such as picture information outputted from the predictive residual coding unit 

20 1 02, and a bit stream outputted eventually is generated by a bit stream generating unit 1 03. 

[0033] A summary of coding flow is explained above, however, detail of process in the motion vector detecting unit 
1 06 and the mode selecting unit 107 is explained below. 

[0034] Motion vector detecting is performed on a block-by-block basis or an area-by-area (area is a divided block) 
basis. Using coded pictures preceding and following a current picture to be coded in display order as reference pictures, 
25 a predictive picture and a prediction mode showing a location which is predicted optimum in the search area in the 
picture is generated by deciding a motion vector. 

[0035] A direct mode is one of bi-predictions which perform inter picture prediction coding prediction referring to two 
pictures preceding and/or following in display order. In the direct mode, a current block to be coded does not contain 
a motion vector directly, and two motion vectors for actual motion compensation are calculated referring to a motion 

30 vector of a co- located block in a coded picture close in display order, and a predictive block is generated. 

[0036] Fig. 7 shows an operation when a coded block referred to in order to determine a motion vector in the direct 
mode contains two motion vectors which refer to two pictures preceding in display order. A picture P23 is a current 
picture to be coded, and performs bi-prediction referring to pictures P22 and P24. Assume that a block to be coded is 
a block MB21 ; and two required motion vectors are determined using a motion vector contained in a block MB22, which 

35 is a co-located block in the coded following reference picture (the second reference picture specified by the second 
reference index) P24. Since the block MB22 contains motion vectors MV21 and MV22 as a motion vector, it is impossible 
to calculate two required motion vectors MV23 and MV24 by scaling directly similarly to Equation 1 . Therefore, similarly 
to Equation 2 shows, a motion vector MV_REF is calculated as a motion vector to be scaled from an average value 
of two motion vectors contained in the block MB22, and a time interval TR_REF at that time is calculated from the 

*o average value likewise. Then, motion vectors MV23 and MV24 are calculated by scaling the motion vector MV_REF 
and the time interval TR_REF based on Equation 3. In this case, the time interval TR21 indicates a time interval between 
the picture P24 and the picture P21 , that is, a picture referred to by the motion vector MV21 , and the time interval TR22 
indicates a time interval until a picture referred to by the motion vector MV22. Moreover, the time interval TR23 is a 
time interval until a picture referred to by the motion vector MV23, and the time interval TR24 is a time interval until a 

45 picture referred to by the motion vector MV24. Time intervals between these pictures can be determined based on, for 
example, information indicating display time and display order added to each picture or difference of information. Note 
that a current picture to be coded refers to a next picture in the example of Fig. 7, however, the case referring to a 
picture which is not next may be treated in the same manner. 



50 



MV_REF= (MV21 + MV22)/2 



Equation 2 (a) 



TR_REF = (TR21 + TR22)/2 



Equation 2 (b) 



55 



MV23 = MV_REF/TR_REF x TR23 



Equation 3 (a) 
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MV24 = -MV_REF/TR_REF x TR24 Equation 3 (b) 

[0037] The above embodiment shows the coding method in which an inter picture prediction coding can be performed 
5 using the direct mode without contradiction even if a block of which motion vector is referred to in the direct mode 
belongs to a B picture. In the coding method, when a block of which motion vector is referred to in the direct mode 
contains plural motion vectors which refer to a picture preceding in display order, one motion vector is generated using 
the plural motion vectors, and two motion vectors to be used for actual motion compensation are determined by scaling. 
Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, it is possible to use Equation 4 instead 
10 of Equation 2 as a method for averaging motion vectors MV21 and MV22, an for averaging time intervals TR21 and 
TR22 in order to calculate the motion vector MV_REF and the time interval TR_REF to be scaled. 
[0038] First, as Equation 4 (a) shows; the motion vector MV2V is calculated by scaling MV21 to equate the time 
interval with the motion vector MV22. Then the motion vector MV_REF is determined by averaging motion vectors 
MV21 ' and MV22. Here, the time interval TR22 is used directly as the time interval TR_RF. Note that the case calculating 
15 a motion vector MV22' by scaling the motion vector MV22 instead of calculating the motion vector MV21 ' by scaling a 
motion vector MV21 may be treated in the same manner. 



20 



25 



35 



40 



45 



MV21 ' = MV21/TR21 x TR22 Equation 4 (a) 

MV_RF = (MV21 ' + MV22)/2 Equation 4 (b) 

TR_REF = TR22 Equation 4 (c) 

[0039] Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector MV22 and a time interval TR22, which refer to a picture 
P22 located temporally closer to a picture P24 of which motion vector is referred to, can be directly used as Equation 
5 shows instead of using an average value of two motion vectors as Equation 2 shows. Likewise, as a motion vector 
MV_REF and a time interval TR_REF, a motion vector MV21 and a time interval TR21 , which refer to a picture P21 
located temporally farther, can be directly used as Equation 6 shows. This makes it possible to reduce capacity of a 
motion vector storing unit in a coding apparatus since each block belonging to a picture P24 of which motion vector is 
referred to can perform motion compensation by storing only one of two motion vectors. 

MV_REF = MV22 Equation 5 (a) 

TR_REF = TR22 Equation 5 (b) 

MV_REF =MV21 Equation 6 (a) 

TR_REF = TR21 Equation 6 (b) 



[0040] Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector which refers to a picture to be coded precedently can 

so be directly used instead of using an average value of two motion vectors as Equation 2 shows. Fig. 8A shows a reference 
relation in display order of moving pictures as Fig. 7 shows, and Fig. 8B shows an example of an order in which pictures 
are reordered by coding order in the frame memory 101 shown in Fig. 6. Here, a picture P23 indicates a picture to be 
coded in the direct mode, and a picture P24 indicates a picture of which motion vector is referred to for the coding. 
When pictures are reordered as shown in Fig. 8B, since a motion vector which refers to a picture to be coded precedently 

55 is directly used, a motion vector MV22 and a time interval TR22 are directly used as a motion vector MV_REF and a 
time interval TR_REF as shown in Equation 5. Likewise, it is possible to directly use a motion vector which refers to a 
picture to be coded later. In this case, a motion vector MV21 and a time interval TR21 are directly applied as a motion 
vector MV_REF and a time interval TR_REF as Equation 6 shows. This make it possible to reduce capacity of a motion 
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vector storing unit in a coding apparatus since each block belonging to a picture P24 of which motion vector is referred 
to can perform motion compensation by storing only one of two motion vectors. 

[0041] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 
5 calculated by multiplying by a constant number. Here, a constant used for the multiplication may be variable when 
coding or decoding is performed on plural blocks basis or on plural pictures basis. 

[0042] Note that in Equation 2 (a) or 4 (b), when a motion vector MV_REF is calculated, after calculating the right 
side of Equation 2 (a) or 4 (b), the motion vector may be rounded to a predetermined motion vector precision (for 
example, round to a value of 0.5 pixel unit for a motion vector with half pixel precision). Precision of a motion vector is 
10 not limited to half pixel precision. In addition, precision of a motion vector can be determined on block basis, picture 
basis, and sequence basis, for example. Moreover, in Equations, 3 (a), 3 (b) and 4 (a), when motion vectors MV23, 
MV24 and MV21 ' are calculated, motion vectors may be rounded to a predetermined precision of a motion vector after 
calculating the right side of Equations 3 (a), 3 (b) and 4 (a). 

is (Second Embodiment) 

[0043] An overview of coding process based on Fig. 6 is completely equal to the first embodiment. Here, a detailed 
operation of bi-prediction in the direct mode is explained using Fig. 9. 

[0044] Fig. 9 shows an operation when a block referred to in order to determine a motion vector in the direct mode 

20 contains two motion vectors which refer to two following pictures in display order. A picture P43 is a current picture to 
be coded, and performs bi-prediction referring to pictures P42 and P44. Assume that a block to be coded is a block 
MB41 , then two required motion vectors are determined using a motion vector of a co-located block MB42 in the coded 
backward reference picture (the second reference picture specified by the second reference index) P44. Since the 
block MB42 contains two motion vectors MV45 and MV46 as motion vectors, two required motion vectors MV43 and 

25 MV44 cannot be calculated by applying directly scaling similarly to Equation 1. Therefore, as Equation 7 shows, a 
motion vector MV_REF is determined as a motion vector to be scaled from an average value of two motion vectors of 
the block MB42, and a time interval TR_REF at that time is determined from an average value likewise. Then motion 
vectors MV43 and MV44 are calculated by scaling a motion vector MV_REF and a time interval TR_REF based on 
Equation 8. In this case, a time interval TR45 indicates a time interval between a picture P44 and P45, that is, until a 

30 picture which is referred to by a motion vector MV45; and a time interval TR46 indicates a time interval until a picture 
which is referred to by a motion vector MV46. A time interval TR43 indicates a time interval until a picture which is 
referred to by a motion vector MV43; and a time interval TR44 indicates a time interval until a picture which is referred 
to by a motion vector MV44. Time intervals between these pictures can be determined based on, for example, infor- 
mation indicating display time and display order that is added to each picture or difference of information as explained 

35 in the first embodiment. Note that a current picture to be coded refers to a next picture in the example of Fig. 9, however, 
the case referring to a picture which is not next may be treated in the same manner. 



40 



45 



MV_REF = (MV45 + MV46)/2 Equation 7 (a) 

TR_REF =r (TR45 + TR46)/2 Equation 7 (b) 

MV43 = -MV_REF/TR_REF x TR43 Equation 8 (a) 

MV44 = M V_R E F/TR_R E F x TR44 Equation 8 (b) 

so [0045] The above embodiment shows the coding method in which an inter picture prediction coding can be performed 
using the direct mode without contradiction even if a block of which motion vector is referred to in the direct mode 
belongs to a B picture. In the coding method, when a block of which motion vector is referred to in the direct mode 
contains plural motion vectors which refer to a following picture in display order, a motion vector is generated using 
the plural motion vectors, and two motion vectors to be used for actual motion compensation are determined by scaling. 

55 [0046] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, it is possible to use Equation 9 
instead of Equation 7 as a method for averaging motion vectors MV45 and MV46 and for averaging time intervals TR45 
and TR46 in order to calculate the motion vector MV_REF and the time interval TR_REF to be scaled. First, as Equation 
9 (a) shows, the motion vector MV46' is calculated by scaling MV46 to equate the time interval with the motion vector 
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MV45. Then the motion vector MV_REF is determined by averaging motion vectors MV46' and MV45. Here, the time 
interval TR41 is used directly as the time interval TR_REF. Note that the case calculating a motion vector MV45' by 
scaling the motion vector MV45 instead of calculating the motion vector MV46' by scaling a motion vector MV46 may 
be treated in the same manner. 

MV46' = MV46/TR46 x TR45 Equation 9 (a) 

MV_REF = (MV46* + MV45)/2 Equation 9 (b) 

TR_REF = TR45 Equation 9 (c) 

15 [0047] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector MV45 and a time interval TR45, which refer to a picture 
P45 located temporally closer to a picture P44 of which motion vector is referred to, can be directly used as Equation 
1 0 shows instead of using an average value of two motion vectors as Equation 7 shows. Likewise, as a motion vector 
MV_REF and a time interval TR_REF, a motion vector MV46 and a time interval TR46, which refer to a picture P46 

20 located temporally farther can be directly used as Equation 1 1 shows. This method makes it possible to reduce capacity 
of a motion vector storing unit in a coding apparatus since each block belonging to a picture P44 of which motion vector 
is referred to can implement motion compensation by storing only one of two motion vectors. 



25 



30 



35 



MV_REF = MV45 Equation 1 0 (a) 

TR_REF = TR45 Equation 1 0 (b) 

MV_REF = MV46 Equation 1 1 (a) 

TR_REF = TR46 Equation 1 1 (b) 



[0048] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector which refers to a picture to be coded precedently can 
be directly used instead of using an average value of two motion vectors as Equation 7 shows. Fig. 10A shows a 
referential relation of pictures in display order of moving pictures as Fig. 9 shows, and Rg. 10B shows an example of 

40 an order in which pictures are reordered in coding order in the frame memory 1 01 shown in Rg. 6. Here, a picture P43 
indicates a picture to be coded in the direct mode, and a picture P44 indicates a picture of which motion vector is 
referred to for the coding. When pictures are reordered as shown in Rg. 10B, since a motion vector which refers to a 
picture to be coded precedently is directly used, a motion vector MV46 and a time interval TR46 are directly used as 
a motion vector MV_REF and a time interval TR_REF as shown in Equation 1 1 . Likewise, it is possible to directly use 

45 a motion vector which refers to a picture to be coded later, in this case, a motion vector MV45 and a time interval TR45 
are directly applied as a motion vector MV_REF and a time interval TR_REF. This method makes it possible to reduce 
capacity of a motion vector storing unit in a coding apparatus since each block belonging to a picture P44 of which 
motion vector is referred to can perform motion compensation by storing only one of two motion vectors. 
[0049] Note that when a picture which is referred to in order to determine a motion vector in the direct mode contains 

so two motion vectors which refer to two following pictures in display order, it is possible to. perform motion compensation 
assuming that two required motion vectors MV43 and MV44 are "0". This method makes it possible to reduce capacity 
of a motion vector storing unit in a decoding apparatus, and further makes it possible to omit a process of calculating 
a motion vector, since each block belonging to a picture P44 of which motion vector is referred to does not have to 
store a motion vector. 

55 [0050] Note that when a picture which is referred to in order to determine a motion vector in the direct mode contains 
two motion vectors which refer to two following pictures in display order, it is possible to inhibit referring to a motion 
vector and to apply only a prediction coding other than the direct mode. When following two pictures in display order 
are referred to as a picture P44 shown in Rg. 9, it is conceivable that correlation with a preceding picture in display 
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order is low, because it is possible to generate a more precise predictive picture by inhibiting the direct mode and 
selecting other predicting method. 

[0051] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 
calculated by multiplying by a constant number. Here, a constant used for the multiplication may be variable when 
coding or decoding is performed on plural blocks basis or on plural pictures basis. 

[0052] Note that in Equation 7 (a) or 9 (b), when a motion vector MV_REF is calculated, after calculating the right 
side of Equation 7 (a) or 9 (b), the motion vector may be rounded to a predetermined motion vector precision. Precision 
of a motion vector includes half pixel precision, one-third pixel precision and quarter pixel precision or the like. In 
addition, the precision of a motion vector can be determined, for example, on block basis, picture basis, and sequence 
basis. Moreover, in Equations, 8 (a), 8 (b) and 9 (a), when motion vectors MV43, MV44 and MV46' are calculated, 
motion vectors may be rounded to a predetermined precision of a motion vector after calculating the right side of 
Equations 8 (a), 8 (b) and 9 (a). 

(Third Embodiment) 

[0053] A moving picture decoding method according to the third embodiment of the present invention is explained 
using the block diagram shown in Fig. 11. However, it is assumed that the bit stream generated in the picture coding 
method of the first embodiment is inputted. 

[0054] First, various information such as a prediction mode, motion vector information and predictive residual coding 
data is extracted from inputted bit stream by a bit stream analyzer 601 . 

[0055] The prediction mode and the motion vector information are outputted to a prediction mode/motion vector 
decoding unit 608 and a predictive residual coding data is outputted to a predictive residual decoding unit 602. The 
prediction mode/motion compensation decoding unit 608 decodes the prediction mode and a motion vector used in 
the prediction mode. When decoding the motion vector, a decoded motion vector stored in the motion vector storing 
unit 605 is used. Decoded prediction mode and motion vector are outputted to a motion compensation decoding unit 
604. In addition, decoded motion vector is stored in the motion vector storing unit 605 in order to be used for decoding 
motion vectors of following blocks. In the motion compensation decoding unit 604, a predictive picture is generated 
based on the inputted prediction mode and motion vector information using a decoded picture stored in a frame memory 
603 as a reference picture. A decoded picture is generated by inputting the above generated predictive picture into an 
add operating unit 606 and adding the inputted picture to the predictive residual picture generated in a predictive 
residual decoding unit 602. Above embodiment shows an operation for an inter-picture-prediction -bit stream, however, 
a switch 607 switches to a decoding process for an intra-picture-prediction-bit stream. 

[0056] A summary of a decoding flow is shown above, however, detailed process in the motion compensation de- 
coding unit 604 is explained below. 

[0057] Motion vector information is added on block basis or area (a divided block) basis. By using decoded pictures 
preceding and following a current picture to be coded in display order as reference pictures, a predictive picture to 
perform motion compensation from the pictures is generated. 

[0058] A direct mode is one of bi -predictions which perform inter picture prediction coding referring to each of pictu res 
preceding and following in display order. In the direct mode, since a current block to be coded inputs a bit stream which 
does not contain a motion vector directly, two motion vectors for actual motion compensation are calculated referring 
to a motion vector of a co-located block in a decoded picture close in display order, and a predictive picture is generated. 
[0059] Fig. 7 shows an operation when a decoded picture referred to in order to determine a motion vector in the 
direct mode contains two motion vectors which refer to preceding two pictures in display order. A picture P23 is a 
current picture to be decoded, and performs bi-prediction referring to pictures P22 and P24. When it is assumed that 
a block to be decoded is a block MB21 , two required motion vectors are determined using a motion vector of a co- 
located block MB22 in the decoded backward reference picture (the second reference picture specified by the second 
reference index) P24. Since the block MB22 contains two motion vectors MV21 and MV22 as the motion vectors, two 
required motion vectors MV23 and MV24 cannot be calculated by applying the direct scaling similarly to Equation 1. 
Therefore, as Equation 2, a motion vector MV_REF is determined as a motion vector to be scaled from an average 
value of two motion vectors of the block MB22, and a time interval TR_REF at that time is determined from an average 
value likewise. Then motion vectors MV23 and MV24 are calculated by scaling a motion vector MV_REF and a time 
interval TR_REF based on Equation 3. In this case, a time interval TR21 indicates a time interval between a picture 
P24 and P21 , that is, until a picture which is referred to by a motion vector MV21 , and a time interval TR22 indicates 
a time interval until a picture which is referred to by a motion vector MV22. A time interval TR23 indicates a time interval 
until a picture which is referred to by a motion vector MV23; and a time interval TR24 indicates a time interval until a 
picture which is referred to by a motion vector MV24. Time intervals between these pictures can be determined based 
on, for example, information indicating display time and display order added to each picture or difference of information. 
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Note that a current picture to be coded refers to a next picture in the example of Fig. 7, however, the case referring to 
a picture which is not next may be treated in the same manner. 

[0060] The above embodiment shows the decoding method in which an inter picture prediction decoding can be 
performed using the direct mode without contradiction even if a block of which motion vector is referred to belongs to 

5 a B picture. In the decoding method, when a block of which motion vector is referred to in the direct mode contains 
plural motion vectors which refer to a preceding picture, a motion vector is generated using the plural motion vectors, 
and two motion vectors to be used for actual motion compensation are determined by scaling. 
[0061] Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, it is possible to use Equation 4 
instead of Equation 2 as a method for averaging motion vectors MV21 and MV22 and for averaging time intervals TR21 

10 andTR22in order to calculate the motion vector MV_REF and the time interval TR_REF to be scaled. First, as Equation 

4 (a) shows, the motion vector MV2V is calculated by scaling MV21 to equate the time interval with the motion vector 
MV22. Then the motion vector MV_REF is determined by averaging motion vectors MV21 ' and MV22. Here, the time 
interval TR22 is used directly as the time interval TR_REF. Note that the case calculating a motion vector MV22* by 
scaling the motion vector MV22 instead of calculating the motion vector MV21' by scaling a motion vector MV21 may 

15 be treated in the same manner. 

[0062] Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector MV22 and a time interval TR22, which refer to a picture 
P22 located temporally closer to a picture P24 of which motion vector is referred to, can be directly used as Equation 

5 shows instead of using an average value of two motion vectors as Equation 2 shows. Likewise, as a motion vector 
20 MV_REF and a time interval TR_REF, a motion vector MV21 and a time interval TR21 , which refer to a picture P21 

located temporally farther can be directly used as Equation 6 shows. This method makes it possible to reduce capacity 
of a motion vector storing unit in a coding apparatus since each block belonging to a picture P24 of which motion vector 
is referred to can actualize motion compensation by storing only one of two motion vectors. 

[0063] Note that when two motion vectors MV23 and MV24 in Fig. 7 are calculated, as a motion vector MV_REF 
25 and a time interval TR_REF which are scaled, a motion vector which refers to a picture to be decoded precedently can 
be directly used, instead of using an average value of two motion vectors as Equation 2 shows. Fig. 8A shows a 
referential relation in display order of moving pictures as Fig. 7 shows, and Fig. 8B shows an order in which a bit stream 
is inputted, that is, a decoding order. Here, a picture P23 indicates a picture decoded in the direct mode, and a picture 
P24 indicates a picture of which motion vector is referred to for the decoding. When considering an order as shown in 
30 Rg. 8B, since a motion vector which refers to a picture to be decoded precedently is directly used, a motion vector 
M V22 and a time interval TR22 are directly applied as a motion vector M V_REF and a time interval TR_REF as Equation 
5 shows. Likewise, it is possible to directly use a motion vector which refers to a picture to be decoded later. In this 
case, a motion vector MV21 and a time interval TR21 are directly applied as a motion vector MV_REF and a time 
interval TR_REF as Equation 6 shows. This makes possible to reduce capacity of a motion vector storing unit in a 
35 decoding apparatus since each block belonging to a picture P24 of which motion vector is referred to can perform 
motion compensation by storing only one of two motion vectors. 

[0064] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 
calculated by multiplying by a constant number. Here, a constant used for the multiplication may be variable when 
40 coding or decoding is performed on plural blocks basis or on plural pictures basis. 

(Fourth Embodiment) 

[0065] An overview of coding process based on Fig. 1 1 is completely equal to the third embodiment. Here, a detailed 
45 operation of bi-prediction in the direct mode is explained using Fig. 9. However, it is assumed that the bit stream 
generated in the picture coding method of the first embodiment is inputted. 

[0066] Fig. 9 shows an operation when a picture referred to in order to determine a motion vector in the direct mode 
contains two motion vectors which refer to following two pictures in display order. A picture P43 is a current picture to 
be decoded, and performs bi-prediction referring to pictures P42 and P44. When it is assumed that a block to be 

50 decoded is a block MB41, two required motion vectors are determined using a motion vector of a co-located block 
MB42 in the decoded backward reference picture (the second reference picture specified by the second reference 
index) P44. Since the block MB42 contains two motion vectors MV45 and MV46 as motion vectors, two motion vectors 
MV43 and MV44 cannot be calculated by directly scaling similarly to Equation 1. Therefore, as Equation 7, a motion 
vector MV_REF is determined as a motion vector to be scaled from an average value of two motion vectors of the 

55 block MB42, and a time interval TR_REF at that time is determined from an average value likewise. Then motion 
vectors MV43 and MV44 are calculated by scaling a motion vector MV_REF and a time interval TR_REF based on 
Equation 8. In this case, a time interval TR45 indicates a time interval between a picture P44 and P45, that is, until a 
picture which is referred to by a motion vector MV45; and a time interval TR46 indicates a time interval between until 
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a picture which is referred to by a motion vector MV46. A time interval TR43 indicates a time interval until a picture 
which is referred to by a motion vector MV43; and a time interval TR44 indicates a time interval until a picture which 
is referred to by a motion vector MV44. Note that a current picture to be decoded refers to a next picture in the example 
of Fig. 9, however, the case referring to a picture which is not adjacent may be treated in the same manner 

5 [0067] The above embodiment shows the decoding method in which an inter picture prediction decoding can be 
performed using the direct mode without contradiction even if a block of which motion vector is referred to in the direct 
mode belongs to a B picture. In the decoding method, when a block of which motion vector is referred to in the direct 
mode contains plural motion vectors which refer to a following picture in display order, a motion vector is generated 
using the plural motion vectors, and two motion vectors to be used for actual motion compensation are determined by 

10 scaling. 

[0068] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, it is possible to use Equation 7 
instead of Equation 9 as a method for averaging motion vectors MV45 and MV46 and for averaging time intervals TR45 
andTR46 in order to calculate the motion vector MV_REF and the time interval TR_REF to be scaled. First, as Equation 

9 (a) shows, the motion vector MV46' is calculated by scaling MV46 to equate the time interval with the motion vector 
15 MV45. Then the motion vector MV_REF is determined by averaging motion vectors MV46* and MV45. Here, the time 

interval TR45 is used directly as the time interval TR_RF. Note that the case calculating a motion vector MV45' by 
scaling the motion vector MV45 instead of calculating the motion vector MV46' by scaling a motion vector MV46 may 
be treated in the same manner. 

[0069] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, as a motion vector MV_REF 
20 and a time interval TR_REF which are scaled, a motion vector MV45 and a time interval TR45, which refer to a picture 
P45 located temporally closer to a picture P44 of which motion vector is referred to, can be directly used, as Equation 

10 shows, instead of using an average value of two motion vectors as Equation 7 shows. Likewise, as a motion vector 
MV_REF and a time interval TR_REF, a motion vector MV46 and a time interval TR46, which refer to a picture P46 
located temporally farther can be directly used as Equation 11 shows. This method makes it possible to reduce capacity 

25 of a motion vector storing unit in a decoding apparatus since each block belonging to a picture P44 of which motion 
• vector is referred to can implement motion compensation by storing only one of two motion vectors. 
[0070] Note that when two motion vectors MV43 and MV44 in Fig. 9 are calculated, as a motion vector MV_REF 
and a time interval TR_REF which are scaled, a motion vector which refers to a picture to be decoded precedentiy can 
be directly used, instead of using an average value of two motion vectors as Equation 7 shows. Fig. 10A shows a 

30 referential relation in display order of moving pictures as Fig. 9 shows and Fig. 10B shows an order in which a bit 
stream is inputted, that is, a decoding order. Here, a picture P43 indicates a picture which is decoded in the direct 
mode, and a picture P44 indicates a picture of which motion vector is referred to for the decoding. When considering 
an order as shown in Fig. 10B, since a motion vector which refers to a picture to be decoded precedentiy is directly 
used, a motion vector MV46 and a time interval TR46 are directly applied as a motion vector MV_REF and a time 

35 interval TR_REF as Equation 1 0 shows. This method makes it possible to reduce capacity of a motion vector storing 
unit in a decoding apparatus since each block belonging to a picture P44 of which motion vector is referred to can 
perform motion compensation by storing only one of two motion vectors. 

[0071] Note that when a block which is referred to in order to determine a motion vector in the direct mode contains 
two motion vectors which refer to two following pictures in display order, it is possible to perform motion compensation 
40 assuming that two required motion vectors MV43 and MV44 are "0". This method makes It possible to reduce capacity 
of a motion vector storing unit in a decoding apparatus and further makes it possible to omit a process of calculating 
a motion vector since each block belonging to a picture P44 of which motion vector is referred to does not have to 
store a motion vector. 

[0072] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
45 a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 
calculated by multiplying by a constant number. Here, a constant used for the multiplication may be variable when 
coding or decoding is performed on plural blocks basis or on plural pictures basis. 

(Fifth Embodiment) 

50 

[0073] Coding/decoding method can be actualized not only by the coding/decoding method shown in the above first 
embodiment through fourth embodiment, but also by a motion vector calculation method shown below. 
[0074] Fig. 12 shows an operation when a coded or decoded block referred to in order to calculate a motion vector 
in the direct mode contains two motion vectors which refer to preceding two pictures in display order. A picture P23 is 
55 a current picture to be coded or decoded. When it is assumed that a block to be coded or decoded is a block MB1 , 
two required motion vectors are determined using a motion vector of a co-located block MB2 in the coded or decoded 
backward reference picture (the second reference picture specified by the second reference index) P24. Note that in 
Fig. 12, the block MB1 is a current block of process, the blocks MB1 and MB2 are co-located blocks in other pictures, 
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and the motion vectors MV21 is first forward motion vector that the reference picture is specified by first reference 
index and MV22 is forward motion vector that the reference picture is specified by second reference index, and these 
motion vectors are used for coding or decoding the block MB2 and refer to pictures P21 and P22 respectively. The 
pictures P21 , P22 and P24 are coded or decoded pictures. A time interval TR21 is a time interval between the picture 
5 P21 and the picture P24; a time interval TR22 is a time interval between the picture P22 and the picture P24; a time 
interval TR2V is a time interval between P21 and the picture P23; and a time interval TR 24' is a time interval between 
the picture P23 and the picture P24. 

[0075] In the motion vector calculation method, as shown in Fig. 12, only the forward motion vector (the first motion 
vector) MV21 coded or decoded precedently is used out of motion vectors of the block MB2 in the reference picture 
10 P24; and a motion vectors MV2V and MV24' of the block MB1 are calculated by following equations. 

MV21 ' = MV21 x TR21 '/TR21 

MV24' = -MV21 x TR24YTR21 

[0076] Then bi-prediction is performed from the pictures P21 and P24 using the motion vectors MV21' and MV24V 
Note that a motion vector of the block MB1 may be calculated using only a motion vector (the second motion vector) 

20 MV22 coded or decoded later out of motion vectors of the block MB2 in the reference picture P24, instead of calculating 
motion vectors MV21 ' and MV24' of the block MB1 using only the motion vector MV21 . Moreover, as shown in the first 
embodiment through the fourth embodiment, a motion vector of the block MB1 may be determined using both the 
motion vectors MV21 and MV22. When selecting one of the motion vectors MV21 and MV22, a motion vector of a 
block coded or decoded precedently may be selected, and it may be set arbitrarily in a coding apparatus and a decoding 

25 apparatus. Motion compensation is possible either when the picture P21 is in the short term reference picture buffer 
or in the long term reference picture buffer. Explanation will be given for the short term reference picture buffer and 
the long term reference picture buffer later. 

[0077] Fig. 13 shows an operation when a coded or decoded block referred to in order to calculate a motion vector 
in the direct mode contains two motion vectors which refer to following two pictures in display order. A picture ~P22 is 

30 a current picture to be coded or decoded. When it is assumed that a block to be coded or decoded is a block MB1 , 
two required motion vectors are determined using a motion vector of a co-located block MB2 in the coded or decoded 
backward reference picture (the second reference picture) P23. Note that in Fig. 13 the block MB1 is a current block 
of processing, the blocks MB1 and MB2 are co-located blocks in pictures, and the motion vectors MV24 and MV25 
are backward motion vectors used for coding or decoding the block MB2 and refer to pictures P21 and P22, respectively. 

35 The pictures P21 , P23, P24 and P25 are coded or decoded pictures. A time interval TR24 is a time interval between 
the picture P23 and the picture P24, a time interval TR25 is a time interval between the picture P23 and the picture 
P25, a time interval TR24' is a time interval between P22 and the picture P24, and a time interval TR 21 1 is a time 
interval between the picture P21 and the picture P22. 

[0078] In a motion vector calculation method, as shown in Fig. 13, only the backward motion vector MV24, which 
40 refers to the picture P24, of the block MB2 in the reference picture P23 is used, and a motion vectors MV21 ' and MV24* 
are calculated by following equations. 

MV21 1 = -MV24 x TR21 7TR24 

45 

MV24' = MV24 x TR24'/TR24 

[0079] Then bi-prediction is performed from the pictures P21 and P24 using the motion vectors MV2V and MV24\ 
50 [0080] Note that, as shown in Fig. 1 4, when only a backward motion vector MV 25, which points at the picture P25, 
of the block MB2 in the reference picture P23 is used, motion vectors MV21 ' and MV24' are calculated by following 
equations. Here, a time interval TR24 is a time interval between the picture P23 and the picture P24; a time interval 
TR25 is a time interval between the picture P23 and the picture P25; a time interval TR25' is a time interval between 
the picture P22 and the picture P25; and a time interval TR21 1 is a time interval between the picture P21 and the picture 
55 P22. 

MV21 ' = -MV25 X TR21 7TR25 
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MV25' = MV25 x TR25'/TR25 

[0081] Then bi-prediction is performed from the pictures P21 and P24 using the motion vectors MV2V and MV24\ 
[0082] Fig. 15 shows an operation when a coded or decoded block referred to in order to calculate a motion vector 
in the direct mode contains two motion vectors which refer to a preceding picture in display order. A picture P23 is a 
current picture to be coded or decoded. When it is assumed that a block to be coded or decoded is a block MB1 , two 
required motion vectors are determined using a motion vector of a co-located block MB2 in the coded or decoded 
backward reference picture (the second reference picture specified by the second reference index) P24. Note that in 
Fig. 1 5, the block MB1 is a current block of processing, the blocks MB1 and MB2 are co-located blocks in other pictures. 
The motion vectors MV21 A and MV21B are forward motion vectors used for coding or decoding the block MB2, and 
both refer to the picture P21 . The pictures P21 , P22 and P24 are coded or decoded pictures. Time intervals TR21 A 
and TR 21 B are a time interval between the picture P21 and the picture P24; a time interval TR21' is a time interval 
between the picture P21 and the picture P23; and a time interval TR24' is a time interval between P23 and the picture 
P24. 

[0083] In a motion vector calculation method, as shown in Fig. 15, only the forward motion vector MV21 A, which 
points at the picture P21, of the block MB2 in the reference picture P24 is used, and a motion vectors MV21A' and 
MV24' are calculated by following equations. 

MV21A' = MV21AXTR217TR21A 



MV24' = -MV21 A X TR247TR21 A 

[0084] Then bi-prediction is performed from the pictures P21 and P24 using the motion vectors MV21 A' and MV24V 
[0085] Note that a motion vector of the block MB1 may be calculated using only a forward motion vector MV21B, 
which points at the picture P21 , of the block MB2 in the reference picture P24. Moreover, as shown in the first embod- 
iment through the fourth embodiment, a motion vector of the block MB1 may be determined using both forward motion 
vectors MV21A and MV21 B. When selecting one of the forward motion vectors MV21A and MV21B, a motion vector 
of a block coded or decoded precedently (described earlier in a bit stream) may be selected, and it may be set arbitrarily 
by a coding apparatus and a decoding apparatus. Here, the motion vector coded or decoded precedently means the 
first motion vector. Motion compensation is possible either when the picture P21 in the short term reference picture 
buffer or in the long term reference picture buffer. Explanation will be given for the short term reference picture buffer 
and the long term reference picture buffer later. 

[0086] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 
calculated by multiplying by a constant number. Here, a constant used for the multiplication may be variable when 
coding or decoding is performed on plural blocks basis or on plural pictures basis. 

[0087] Note that, in the above-mentioned equations to calculate motion vectors MV21', MV24\ MV25' and MV21A', 
motion vectors may be rounded to a predetermined precision of a motion vector after calculating the right side of the 
equations. Precision of motion vector includes half pixel precision, one-third pixel precision and quarter pixel precision 
or the like. In addition, precision of a motion vector can be determined, for example, on block basis, picture basis, and 
sequence basis. 

(Sixth Embodiment) 

[0088] In this sixth embodiment, a method for calculating a current motion vector by scaling only one of two forward 
motion vectors, which refer to two pictures preceding in display order, is explained using Figs. 14, 15 and 16. In this 
case, a reference picture used to in order to determine a current motion vector in the direct mode contains the two 
forward motion vectors. Note that the block MB1 is a current block to be processed; the blocks MB1 and MB2 are co- 
located blocks in other pictures; and the motion vectors MV21 and MV22 are forward motion vectors used for coding 
or decoding the block MB2, and refer to pictures P21 and P22, respectively. The pictures P21 , P22 and P24 are coded 
or decoded pictures. A time interval TR21 is a time interval between the picture P21 and the picture P24; a time interval 
TR22 is a time interval between the picture P22 and the picture P24; a time interval TR21' is a time interval between 
P21 and the picture P23; and a time interval TR22' is a time interval between P22 and the picture P23. 
[0089] As the first method, when a block MB2 in a reference picture P24 contains a forward motion vector MV21 
referring to a picture P22 and a forward motion vector MV22 referring to a picture P23 as shown in Fig. 1 6, a motion 
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vector MV22' of the block MB1 is calculated using only a motion vector MV22 which refers to a picture P22 close to a 
current picture P23 in display order by a following equation. 

5 MV22' = MV22 x TR22'/TR22 

[0090] Then motion compensation is performed from the picture P22 using the motion vector MV22'. 
[0091] As the second method, when a block MB2 of a reference picture P24 contains a forward motion vector MV21 
referring to a picture P21 and a forward motion vector MV22 referring to a picture P22 as shown in Fig. 17, a motion 
10 vector MV2V of the block MB1 is calculated using only a motion vector MV21 which refers to a picture P21 being far 
from a current picture P23 in display order by a following equation. 

MV21' = MV21 XTR217TR21 

15 

[0092] Then motion compensation is performed from the picture P21 using the motion vector MV21'. 
[0093] The first and the second methods make it possible to reduce capacity of a motion vector storing unit since 
the block MB2 belonging to a picture P24 of which motion vector is referred to can actualize motion compensation by 
storing only one of two motion vectors. 
20 [0094] Note that motion compensation can be performed from a picture P22 close in display order, using the forward 
motion vector MV21 same as the first embodiment A motion vector MVN (not shown in this figure) used for the motion 
compensation is calculated by a following equation. 

25 MVN = MV21 x TR227TR21 

[0095] As the third method, as shown in Fig.1 8, a motion compensation block is obtained from the pictures P21 and 
P22 respectively using the motion vectors MV21' and MV22' calculated above, and an average picture is used as an 
interpolation picture in motion compensation. 

30 [0096] The third method increases calculated amount, however, improves precision of motion compensation. 

[0097] Moreover, it is possible to obtain a motion compensation block from the picture P22 using the above-mentioned 
motion vectors MVN and MV22', and to use an average picture as an interpolation picture in motion compensation. 
[0098] Note that in this embodiment, the case that a motion vector used in the direct mode is calculated by scaling 
a referenced motion vector using a time interval between pictures is explained, however, the motion vector may be 

35 calculated by multiplying a reference motion vector by a constant number. Here, a constant used for the multiplication 
may be variable when coding or decoding is performed on plural blocks basis or on plural pictures basis. 
[0099] Note that, in the above-mentioned equations to calculate the motion vectors MV21', MV22" and MVN, the 
motion vectors may be rounded to a predetermined precision of a motion vector after calculating the right side of the 
equations. Precision of motion vector includes half pixel precision, one-third pixel precision and quarter pixel precision 

40 or the like. In addition, precision of a motion vector can be determined, for example, on block basis, picture basis, and 
sequence basis. 

(Seventh Embodiment) 

45 [0100] In the above sixth embodiment, the case when a reference picture used to determine a motion vector of a 
current block to be coded or decoded contains two forward motion vectors in the direct mode is described. The two 
forward motion vectors refer to two preceding pictures in display order. However, when the reference picture contains 
two backward motion vectors which refer to two following pictures in display order, it is possible to calculate a current 
motion vector by scaling only one of two backward motion vectors (the second motion vectors of which reference 

50 picture is specified by the second reference indices), likewise. Explanation will be given using Figs. 17 — 20 below. 
Note that the block MB1 is a current block of process, the blocks MB1 and MB2 are co-located blocks in other pictures, 
and the motion vectors MV24 and MV25 are backward motion vectors (the second motion vectors of which reference 
picture is specified by the second reference indices) used for coding or decoding the block MB2. The pictures P21 , 
P23, P24 and P25 are coded or decoded pictures. A time interval TR24 is a time interval between the picture P23 and 

55 the picture P24; a time interval TR25 is a time interval between the picture P23 and the picture P25; a time interval 
TR24' is a time interval between P22 and the picture P25; and a time interval TR25' is a time interval between P22 
and the picture P25. 

[0101] As the first method, when a block MB2 of a reference picture P23 contains two backward motion vectors 
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MV24 referring to a picture P24 and MV25 referring to a picture P25 as shown in Fig. 1 9, a motion vector MV24' of the 
block MB1 is calculated using only a backward motion vector MV24 which refers to a picture P24 being temporally 
close to a current picture P22 by a following equation. 

MV24' = MV24 x TR24'/TR24 

[0102] Then motion compensation is performed from the picture P24 using the motion vector MV24'. 
[0103] Note that motion compensation can be performed from a picture P23 close in display order, using a backward 
fo motion vector MV24 same as the first embodiment. A motion vector MVN1 (not shown in this figure) used for the motion 
compensation is calculated by a following equation. 

MVN 1 = MV24 x TRN1/TR24 

15 

[0104] As the second method, when a block MB2 of a reference picture P23 contains two backward motion vectors 
MV24 referring to a picture P24 and MV25 referring to a picture P25 as shown in Fig. 20, a motion vector MV25' of the 
block MB1 is calculated using only a backward motion vector MV25 which refers to a picture P25 far from a current 
picture P23 in display order by a following equation. 

20 

MV25 1 = MV25 x TR25'/TR25 

[0105] Then motion compensation is performed from the picture P25 using the motion vector MV25\ 
25 [0106] The first and the second methods make it possible to reduce capacity of a motion vector storing unit since 
the block MB2 belonging to a picture P23 of which motion vector is referred to can implement motion compensation 
by storing only one of two motion vectors. 

[01 07] Note that a motion compensation can be performed from a picture P23 close in display order, using a backward 
motion vector MV25 as same as the first embodiment. A motion vector MVN2 (not shown in this figure) used for the 
30 motion compensation is calculated using a following equation. 

MVN2 = MV25 x TRN1/TR25 

35 [01 08] As the third method, as shown in Fig. 21 , a motion compensation block is obtained from the pictures P24 and 
P25 respectively using the motion vectors MV24' and MV25' calculated above, and an average picture is used as an 
interpolation picture in a motion compensation. 

[0109] The third method increases the amount of calculation, however, improves precision of motion compensation. 
[0110] Note that, it is possible to obtain a motion compensation block from the picture P24 using the above-mentioned 
40 motion vectors MVN1 and MVN2, and use an average picture as an interpolation picture in motion compensation. 

[01 1 1] Moreover, as shown in Fig. 22, when a reference picture referred to in order to determine a motion vector of 
a current motion vector in the direct mode contains a backward motion vector which refers to a picture following in 
display order, for example, a motion vector MV24' is calculated using a following equation. 

MV24' = MV24 x TR24'/TR24 

[0112] Then motion compensation is performed from the picture P24 using the motion vector MV24 1 . 
[0113] Note that motion compensation can be performed from a picture P23 close in display order, using a backward 
so motion vector MV25 as same as the first embodiment. A motion vector MVN3 (not shown in this figure) used for the 
motion compensation is calculated by a following equation. 

MVN3 = MV24 x TRN1/TR24 

55 

[0114] Note that in this embodiment, the case when a current motion vector is calculated by scaling the backward 
motion vector is explained, when containing two backward motion vectors, which refer to two pictures following in 
display order, and when containing a backward motion vector, which refers to a picture following in display order. 
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However, a current motion vector may be calculated referring to a motion vector of a neighboring block in a same 
picture without using a backward motion vector, and when intra picture coding is performed, a current motion vector 
may be calculated referring to a motion vector of a neighboring block in a same picture. 
[0115] To begin with, the first calculation method will be described. 

5 [01 1 6] Fig. 23 shows a positional relation between a motion vector to be referred to and a current block. A block MB 
1 is a current block, and refers to a motion vector of a block including three pixels located on A, B and C. Note that 
when a pixel C cannot be referred to since it locates outside of a frame or it has not been coded/decoded, a motion 
vector of a block including a pixel D is used instead of a block including the pixel C. BY calculating a median value of 
motion vectors of three current blocks including pixels A, B and C to be referred to, a motion vector used actually in 

10 the direct mode is determined. By calculating a median value of motion vectors of three blocks, additional information 
showing which motion vector is selected is not necessary to be described in a bit stream. Hence, it is possible to obtain 
a motion vector expressing motion close to actual motion of the block MB1 . In this case, motion compensation may 
be performed by only forward reference (reference to the first reference picture) using the determined motion vector 
and by bi-directional reference (reference to the first reference picture and the second reference picture) using a motion 

15 vector parallel with the determined motion vector. 

[0117] Next, the second calculation method will be described. 

[0118] Under the second calculation method, an median value is not selected as the first calculation method, and a 
motion vector used in actual direct mode is determined by selecting a motion vector of which coding efficiency is the 
highest of motion vectors of three blocks including pixels A, B and C. In this case, motion compensation may be 

20 performed by only forward reference (reference to the first reference picture), using the determined motion vector and 
by bi-directional reference (reference to the first reference picture and the second reference picture) using a motion 
vector parallel with the determined motion vector. Information indicating a motion vector with the highest coding effi- 
ciency is, for example as shown in Fig. 24A, added to a header area of a block in a bit stream generated by a bit stream 
generating unit 103 with information indicating a direct mode outputted from a mode selecting unit 107. Note that as 

25 shown in Fig. 24B, the information indicating a motion vector with the highest coding efficiency may be added to a 
header area of a macrobiock. Here, information indicating a motion vector with the highest coding efficiency is, for 
example, a number identifying a block including a current pixel to be referred to, and an identification number given to 
every block. When a block is identified by the identification number, a motion vector with the highest coding efficiency 
maybe indicated by using only one of motion vectors used for coding a block corresponding to an identification number, 

30 and when motion vectors are more than 1 , a motion vector with the highest coding efficiency may be indicated by using 
plural motion vectors. Or, a motion vector with the highest coding efficiency may be indicated by using an identification 
number given to every block to every motion vector in bi-direction (reference to the first reference picture and the 
second reference picture). This selecting method makes it possible to always select a motion vector which makes 
coding efficiency the highest. However, since additional information showing which motion vector is selected needs to 

35 be described in a bit stream, extra amount of code for the additional information is necessary. In addition, the third 
calculation method is explained. 

[0119] Under the third calculation method, a motion vector referring to a reference picture with the smallest reference 
index is determined as a motion vector used in an actual direct mode. The smallest reference index means generally 
a motion vector which refers to a picture close in display order or a motion vector with the highest coding efficiency. 
40 Therefore, this motion vector selecting method makes it possible to improve coding efficiency, since a motion vector 
used in the direct mode is generated using a motion vector which refers to a picture closest in display order or a motion 
vector with the highest coding efficiency. 

[0120] Note that when all of three motion vectors refer to a same reference picture, a median value of the three 
motion vectors may be used. On the other hand, when two of three motion vectors refer to a reference picture with the 

45 smallest reference index value, for example, same one of the two motion vectors may be always selected. As an 
example, referring to Fig. 23, there are three blocks including pixels A, B and C respectively, and when reference index 
values of blocks including pixels A and B are the smallest, and a same reference picture is referred to, a motion vector 
in the block including the pixel A may be selected. However, when reference index values of blocks including pixels A 
and C are the smallest, and a same reference picture is referred to, a motion vector in a block BL1 including the pixel 

50 a located closer to a block may be selected. 

[0121] Note that the above-mentioned median value may be a median value of components in horizontal direction 
and vertical direction of each motion vector, and may be a median value of value (absolute value) of each motion vector. 
[0122] In the case as shown in Fig. 25, a median value of motion vectors may be a median value of motion vectors 
contained in 5 blocks: a co-located block of a block BL1 in a following reference picture; blocks including pixels A, B 

55 and C respectively;-M-+ and a block including a pixel D shown in Fig. 25. As described above, when a co-located block, 
which is close to a current pixel to be coded, of the block BL1 in a following reference picture is used, a process of 
calculating a median value of motion vectors becomes easier by using a block including the pixel D in order to make 
the number of blocks an odd number. Note that when plural blocks are on a co-located area of the block BL1 in a 



19 



EP 1 411 729 Page 20 of 97 



EP1 411 729 A1 

following reference picture, a motion compensation may be performed for the block BL1 by using a motion vector in a 
block which occupies the largest area overlapped with the block BL1 , or by dividing the block BL1 corresponding to 
area of the plural block in the following reference picture and a motion compensation may be performed on divided 
block basis. 

s [0123] An explanation will be further given using concrete examples. 

[0124] As shown in Figs. 26 and 27, when all blocks including pixels A, B and C contain a motion vector which refers 
to a picture preceding a current picture to be coded, any of the above-mentioned the first through the third calculation 
methods may be used. 

[0125] Likewise, as shown in Figs. 28 and 29, when all blocks including pixels A, B and C contain a motion vector 
10 which refers to a picture following a current picture to be coded, any of the first through the third calculation methods 
may be used. 

[0126] Next, the case shown in Fig. 30 is explained. Fig. 30 shows the case that each of blocks including pixels A, 
B and C respectively contains each of motion vectors, one refers to picture preceding a current picture to be coded 
and another refers to a picture following a current picture to be coded. 

is [0127] According to the first calculation method, a forward motion vector used for motion compensation of the block 
BL1 is selected by a median value of motion vectors MVAf , MVBf and MVCf, and a backward motion vector used for 
motion compensation of the block BL1 is selected by a median value of motion vectors MVAb, MVBb and MVCb. Here, 
the motion vector MVAf is a forward motion vector of a block containing the pixel A, the motion vector MVAb is a 
backward motion vector of a block containing the pixel A, the motion vector MVBf is a forward motion vector of a block 

20 containing the pixel B, the motion vector MVBb is a backward motion vector of a block containing the pixel B, the motion 
vector MVCf is a forward motion vector of a block containing the pixel C, the motion vector MVCb is a backward motion 
vector of a block containing the pixel C. Motion vectors such as the motion vector MVAf are not limited to the case 
referring to a picture as shown in the figure. Same applies to a following explanation. 

[0128] According to the second calculation method, a motion vectorto be used in the actual direct mode is determined 

25 by selecting a motion vector with the highest coding efficiency of forward reference motion vectors MVAf, MVBf and 
MVCf, and selecting a motion vector with the highest coding efficiency of backward reference motion vectors MVAb, 
MVBb and MVCb. In this case, motion compensation may be performed by only forward reference using a motion 
vector with the highest coding efficiency of forward reference motion vectors MVAf, MVBf and MVCf, and by bi-prediction 
using a motion vector parallel with the determined motion vector. Note that, in order to achieve the highest coding 

30 efficiency, a motion compensation may be performed by selecting one block and using forward and backward reference 
motion vectors of the selected block instead of selecting for forward and backward reference motion vectors respec- 
tively. In this case, since information indicating a selection can be reduced as compared with the case that selecting 
information indicating a block which contains a pixel having a forward motion vector selected for the highest coding 
efficiency and a block which contains a pixel having a backward motion vector selected for the highest coding efficiency, 

35 coding efficiency can be improved. The selection of the block may be from followings: 1 . A block includes a pixel having 
a forward reference motion vector which refers to a reference picture with the smallest reference index value; 2. A 
block has the smallest value which is a sum of a reference index value of a picture referred to by a forward reference 
motion vector of a block including each pixel and a reference index value of a picture referred to by a backward reference 
motion vector of a block including each pixel; 3. A block selects a median value of reference indices of a picture referred 

40 to by a forward reference motion vector and includes a pixel having a forward reference motion vector with the selected 
median value, and a backward motion vector is included in the block; and 4. A block selects a median value of reference 
indices in a picture referred to by a backward reference motion vector and includes a pixel having a backward motion 
vector with the selected median value, and a forward motion vector is included in the block. Note that when each of 
backward motion vectors refers to a same picture, selecting the method 1 and the method 3 are appropriate. 

45 [0129] According to the third calculation method, one of forward reference motion vectors MVAf, MVBf and MVCf, 
which refers to a reference picture with the smallest reference index value, is a forward reference (the first reference) 
motion vector used in the direct mode. Or, one of backward reference motion vectors MVAb, MVBb and MVCb, which 
refers to a reference picture with the smallest reference index value, is a backward reference (the second reference) 
motion vector used in the direct mode. Note that, in the third calculation method, the forward motion vector referring 

so to the reference picture with the smallest reference index is a forward motion vector of a block BL1 , and the backward 
motion vector referring to the reference picture with the smallest reference index is a backward motion vector of the 
block BL1 , however, two motion vectors BL1 and BL2 may be derived using one of a forward motion vector or a 
backward motion vector referring to a reference picture with the smallest reference index, and motion compensation 
may be performed using the derived motion vector. 

55 [0130] Next, the case shown in Fig. 31 is explained. Fig. 31 shows a case the pixel A contains each of motion vectors, 
one refers to preceding picture and another refers to a following picture, the pixel B contains only a motion vector which 
refers to a preceding picture, and the pixel C contains only a motion vector which refers a following picture. 
[0131] Thus, when there is a block including a pixel containing only a motion vector referring a picture uni-directionally, 
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assuming that a motion vector referring to a picture in another direction is 0, the calculation method in above-mentioned 
Rg.30 may be used for motion compensation. Specifically, using the first or the third calculation method in Fig. 30, 
calculation may be performed assuming MVCf = MVBb = 0. That is, in the first calculation method, when a forward 
motion vector of the block BL1 is calculated, assuming that a motion vector MVCf of the pixel C referring to a preceding 
picture is 0, a median value of motion vectors MVAf, MVBf and MVCf is calculated. On the other hand, when a backward 
motion vector of the block BL1 is calculated, assuming a motion vector MVBb, which refers to a following picture, of 
the pixel B is 0, a median value of motion vectors MVAb, MVBb and MVCb is calculated. 

[0132] According to the third calculation method, assuming that a motion vector MVCf, which refers to a preceding 
picture, of the pixel C and a motion vector, which refers to a following picture, of the pixel B are 0, a motion vector, 
which refers to a reference picture with the smallest reference index value, of the block BL1 is calculated. For example, 
when a block including a pixel A refers to a picture with the first reference index of M 0" and a block including a pixel B 
refers to a picture with the first reference index of "1", the smallest value of reference index is M 0". Therefore, since 
only the motion vector MVBf, which refers to a preceding picture, of the block including the pixel refers to a picture with 
the smallest first reference index, the motion vector MVBf is selected as a forward motion vector of the block BL1 . 
Moreover, for example, when both pixels A and C refer to a following picture with the smallest second reference index, 
for example "0", assuming a motion vector MVBb, which refers to a following picture, of a pixel B is 0, a median value 
of motion vectors MVAb, MVBb and MBCb is calculated. The motion vector resulted from the calculation is set as a 
forward motion vector of the block BL1 . 

[0133] Next, a case shown in Fig. 32 is explained. Fig. 32 shows a case when the pixel A contains each of motion 
vectors; one refers to preceding picture and another refers to a following picture. The pixel B contains only a motion 
vector which refers to a preceding picture and the pixel C does not contain a motion vector and is intra-picture coded. 
[0134] When a block including a current pixel C to be coded is intra-picture coded, assuming that motion vectors 
which refer to pictures preceding and following the block are both 0, the calculation method in above-mentioned Fig. 
30 may be used for motion compensation. Specifically, calculation may be performed assuming MVCf = MVCb = 0. 
Note that, in Fig. 30, MVBb is 0. 

[0135] Lastly, a case shown in Fig. 33 is explained. Fig. 33 shows a case that a pixel C is coded by the direct mode. 
[0136] When blocks including a current pixel referred to contain a block coded by the direct mode, motion compen- 
sation of the block BL1 may be performed using a motion vector used for coding a block coded by the direct mode and 
using the calculation method shown in Fig. 30. 

[0137] Note that it is determined which of a forward or a backward reference motion vector is used depending on a 
picture to be referred to, a picture to be coded and time information included in each picture. Therefore, when a motion 
vector is derived after differentiate between a forward reference and a backward reference, a motion vector of each 
block is judged if a forward reference or a backward reference from time information contained in each picture. 
[0138] In addition, an example of a calculation method combining the above-mentioned calculation methods is ex- 
plained. Fig. 34 is an illustration showing a procedure for determining a motion vector to be used in the direct mode. 
Fig. 34 is an example of a method for determining a motion vector using reference indices. Note that RidxO and Ridxl 
shown in Fig. 34 are reference indices explained above. Fig. 34A shows a procedure for determining a motion vector 
using the first reference index RidxO, and Fig. 34B shows a procedure for determining a motion vector using the second 
reference index Ridxl . First, Fig. 34A is explained. 

[0139] In a step S3701 , there are three blocks including pixels A, B and C respectively, and the number of blocks 
referring to a picture using the first reference index RidxO is calculated. 

[0140] When the number of blocks calculated in the step S3701 is "0", the number of blocks referring to a picture 
using the second reference index Ridxl is further calculated in a step S3702. When the number of blocks calculated 
in the step S3702 is "O", motion compensation is performed bi-directionally for a current block to be coded assuming 
that a motion block of the current block to be coded is "0" in a step S3703. On the other hand, when the number of 
blocks calculated in the step S3702 is M T or more, a motion vector of .a current block to be coded is determined in a 
step S3704 by the number of blocks containing the second reference index Ridxl . For example, motion compensation 
of a current block to be coded is performed using the motion vector determined by the number of blocks containing 
the second reference index Ridxl . 

[0141] When the number of blocks calculated in the step S3701 is "1 M , a motion vector containing the first reference 
index RidxO is used in a step S3705. 

[0142] When the number of blocks calculated in the step S3701 is n 2", a motion vector corresponding to a median 
value of three motion vectors is used in a step S3706, assuming that a block which does not contain the first reference 
index RidxO contains a motion vector of MV = 0 of the first reference index RidxO. 

[0143] When the number of blocks calculated in the step S3701 is "3", a motion vector corresponding to a median 
value of three motion vectors is used in a step S3707. Note that motion compensation in the step S3704 may be 
performed bi-directionally using one motion vector. Here, bi-directional motion compensation may be performed after 
calculating a motion vector in the same direction as one motion vector and a motion vector in the opposite direction to 
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one motion vector, for example, by scaling one motion vector, or may be performed using a motion vector in the same 
direction as one motion vector and a motion vector of "0°. Next, Fig. 34B is explained. 
[0144] The number of blocks containing the second reference index Ridxl is calculated in a step S3711 . 
[0145] When the number of blocks calculated in the step S3711 is "0 a , the number of blocks containing the first 
reference index R!xd1 is further calculated in a step S3712. When the number of blocks calculated in the step S3712 
is "0°, motion compensation is performed bi-directionally for a current block to be coded assuming that a motion block 
of the current block to be coded is "0" in a step S371 3. On the other hand, the number of blocks calculated in the step 
S3712 is "1 W or more, a motion vector of a current block to be coded is determined in a step S3714 by the number of 
blocks containing the first reference index RidxO. For example, motion compensation of a current block to be coded is 
performed using the motion vector determined by the number of blocks containing the first reference index RidxO. 
[01 46] When the number of blocks calculated in the step S371 1 is T, a motion vector containing the second reference 
index Ridxl is used in a step S3715. 

[0147] When the number of blocks calculated in the step S3711 is "2", a motion vector corresponding to a median 
value of three motion vectors is used in a step S3716, assuming that a block which does not contain the second 
reference index Ridxl contains a motion vector of MV = 0 of the second reference index Ridxl . 
[0148] When the number of blocks calculated in the step S3711 is u 3 n , a motion vector corresponding to a median 
value of three motion vectors is used in a step S3717. Note that motion compensation in the step S3714 may be 
performed bi-directionally using one motion vector. Here, bi-directional motion compensation may be performed after 
calculating a motion vector in the same direction as one motion vector and a motion vector in the opposite direction to 
one motion vector, for example, by scaling one motion vector, or may be performed using a motion vector in the same 
direction as one motion vector and a motion vector of "O". 

[0149] Note that Figs. 34A and 34B are explained respectively, but both methods may be used or one of those 
methods may be used. However, when one of those methods is used, for example, when a process started form the 
step 3704 shown in Fig. 34A is used and a process up to the step S3704 is used, a process after the step S3711 shown 
in Fig. 34B may be used. When a process up to the step S3704 is used, since a process after the step S3712 is not 
used, amotion vector can be determined uniquely. When both processes of Figs. 34A and 34B are used, either process 
may be used first, or two processes may be used together. When a block in the neighbor of a current block to be coded 
is coded in the direct mode, a reference index referred to by a motion vector used for coding the block coded in the 
direct mode may be assumed to be contained in a block coded in the direct mode and located in the neighbor of a 
current block to be coded. 

[01 50] Detailed explanation of method for determining a motion vector will be given below using concrete examples 
of a block. Fig. 35 is an illustration showing types of motion vectors contained in each block referred to by a current 
block BL1 to be coded. In Fig. 35A, a block containing a pixel A is a block intra picture coded, and a block containing 
a pixel B includes one motion vector and motion compensation is performed for the block using one motion vector, and 
a block containing a pixel C is a block including two motion vectors and motion compensation is performed bi-direc- 
tionally. The block containing the pixel B contains a motion vector indicated by the second reference index Ridxl . Since 
the block containing the pixel A is a block to be intra picture coded, it does not contain a motion vector. In other words, 
it does not contain a reference index, too. 

[0151] In the step S3701 , the number of blocks containing the first reference index RidxO is calculated. As shown in 
Fig. 35, since the number of blocks containing the first reference index RidxO is 2, assuming that a block which does 
not contain the first reference index RidxO contains a motion vector of MV = 0 of the first reference index RidxO, a 
motion vector corresponding to a median value of three motion vectors is used in a step S3706. Bi-directional motion 
compensation may be performed for a current block to be coded using only the above motion vector, or may be per- 
formed using the second reference index Ridxl and other motion vector as shown below. 

[0152] In the step S371 1 , the number of blocks containing the second reference index Ridxl is calculated. As shown 
in Fig. 35, since the number of blocks containing the second reference index Ridxl is 1 , a motion vector containing 
the second reference index Ridxl is used in the step S3715. 

[0153] In addition; an example of another calculation method combining the above-mentioned calculation methods 
is explained. Fig. 38 is an illustration showing a procedure for determining a motion vector of a current block to be 
coded using reference index values showing a picture referred to by a motion vector contained in blocks including 
pixels A, B and C respectively. Figs. 36A and 36B are illustrations showing a procedure for determining a motion vector 
based on the first reference index RidxO, and Figs. 36C and 36D are illustrations showing a procedure for determining 
a motion vector based on the second reference index Ridxl . Fig. 36 A shows a procedure based on the first reference 
index RidxO, on the other hand, Fig. 36C shows a procedure based on the second reference index Ridxl . Fig. 36B 
shows a procedure based on the first reference index RidxO, on the other hand, Fig. 36D shows a procedure based 
on the second reference index Ridxl . Hence, only Figs. 36A and 36B is explained below. To begin with, Fig. 36A is 
explained. 

[01 54] In a step S3801 , it is judged if the smallest first reference index RidxO of effective first reference indices RidxOs 
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can be selected. 

[0155] When it is possible to select the smallest first reference index RidxO from the effective first reference indices 
RidxOs, the motion vector selected in the step S3802 is used. 

[0156] When the effective first reference indices RidxOs include the plural smallest reference indices RidxOs in the 
5 step S3801, a motion vector contained in a block selected by priority in a step S3803 is used. Here, the priority, for 
example, determines a motion vector to be used for motion compensation of a current block to be coded in alphabetical 
order of pixels contained in blocks. 

[0157] When there is no effective first reference index RidxO in the step S3801 , process that is different from the 
steps S3802 and S3803 is used in a step S3804. For example, a process after a step S3711 explained in Fig. 374B 
10 may be used. Next, Fig. 36B is explained. The different point between Figs. 36A and 36B is that a process in the steps 
S3803 and S3804 in Fig. 36A is changed to a step S3813 in Fig. 36B. 

[01 58] in a step S381 1 , it is judged if the smallest first reference index RidxO of effective first reference indices RidxOs 
can be selected. 

[0159] When it is possible to select the smallest first reference index RidxO from the effective first reference indices 

15 RidxOs, the motion vector selected in the step S3B12 is used. 

[01 60] When there is no effective first reference index RidxO in the step S381 1 , process that is different from S381 2 
is used in the step S3813. For example, a process after a step S3711 explained in Fig. 374B may be used. 
[0161] Note that the above-mentioned effective first reference index RidxO is indicated by "O" in Fig. 35B, and is a 
reference index showing to have a motion vector. In Fig. 35B, the places in which V is written indicates that reference 

20 indices are not assigned. In a step S3824 in Fig. 36C and in a step S3833 in Fig. 36D, a process after the step S3701 
explained in Fig. 34A may be used. 

[0162] Detailed explanation of method for determining a motion vector will be given below using concrete examples 
of a block and Fig. 35. 

[01 63] In a step S3801 , it is judged if the smallest first reference index RidxO of effective first reference indices RidxOs 
25 can be selected. 

[01 64] In the case shown in Fig. 35, there are two effective first reference indices RidxO, however, when it is possible 
to select one smallest first reference index RidxO from effective first reference indices RidxOs in the step S3801 , a 
motion vector selected in the step S3802 is used. 

[0165] When the effective first reference indices RidxOs include the plural smallest reference indices RidxOs in the 
30 step S3801 , a motion vector contained in a block selected by priority in a step S3803 is used. Here, the priority, for 
example, determines a motion vector to be used for motion compensation of a current block to be coded in alphabetical 
order of pixels contained in blocks. When blocks including pixels B and C respectively contain the same first reference 
index RidxO, the first reference index RidxO of the block including the pixel B is employed by the priority, and motion 
compensation is preformed for a current block BL1 to be coded using amotion vector corresponding to the first reference 
35 index RidxO of the block containing the pixel B. In this case, motion compensation may be performed for the current 
block BL1 to be coded bi-directionally using only the determined motion vector, or may be performed using the second 
reference index Ridxl and other motion vector as shown below. 

[0166] In a step S3821 , it is judged if the smallest second reference index Ridxl of effective second reference indices 
Ridxl s can be selected. 

^o [0167] In the case shown in Fig. 35, since the effective second reference index Ridxl is one, a motion vector corre- 
sponding to the second reference index Ridxl of a block containing the pixel C is used in a step S3822. 
[0168] Note that in the above explanation, as for the block which does not contain reference indices, assuming that 
the block contains a motion vector of "O", a median value of three motion vectors is selected, however, assuming that 
the block contains a motion vector of "0", an average value of three vectors may be selected, or an average value of 
motion vectors of blocks containing reference indices may be selected. 

[0169] Note that a motion vector used for motion compensation of a current block to be coded may be determined 
by priority different from the one explained above, for example, in order of pixels B- A - C, which are contained in blocks. 
[0170] Thus, through determining a motion vector used for motion compensation of a current block to be coded by 
using a reference index, a motion vector can be determined uniquely. Moreover, according to the above-mentioned 
50 example, coding efficiency can be improved. In addition, since it is not necessary to judge whether a motion vector is 
a forward reference or a backward reference using time information, it is possible to simplify a process for determining 
a motion vector. When concerning a prediction mode for every block and a motion vector used for motion compensation 
or the like, there are a lot of patterns, however, as mentioned above since a process is done by a series of flows, it is 
useful. 

55 [01 71] Note that in this embodiment, the case calculating a motion vector used in the direct mode by scaling a motion 
vector referred to using a time interval between pictures is explained, however, a calculation may be performed by 
multiplying by a constant number. Here, a constant used for the multiplication may be variable when coding or decoding 
is performed on plural blocks basis or on plural pictures basis. Note that a calculation method using reference indices 
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RidxO and Ridxl are not only a method using a median value, and calculation methods may be combined with other 
calculation methods. For example, in the above-mentioned third calculation method, when motion vectors contained 
in blocks having pixels A, B and C respectively referring to a same picture of which reference index is the smallest are 
plural, it is not necessary to calculate a median value of these motion vectors, and a motion vector obtained from 
averaging these motion vectors may be used as a motion vector of the block BL1 used in the direct mode. Or, for 
example, a motion vector of which coding efficiency is the highest may be selected from plural motion vectors with the 
smallest reference indices. 

[0172] Moreover, a forward motion vector and a backward motion vector of the block BL1 may be calculated inde- 
pendently or dependently. For example, a forward motion vector and a backward motion vector may be calculated from 
a same motion vector. 

[01 73] On the other hand, either a forward motion vector or a backward motion vector both obtained from the calcu- 
lation may be used as a motion vector of the block BL1 . 

(Eighth Embodiment) 

[0174] In this embodiment, a reference block MB in a reference picture contains a forward (the first) motion vector 
referring to a reference picture stored in the long term picture buffer as the first reference picture and a backward (the 
second) motion vector referring to a reference picture stored in the short term picture buffer as the second reference 
picture. 

[0175] Fig. 37 is an illustration showing a bi-prediction in the direct mode when only one reference picture is stored 
in the long term picture buffer. 

[0176] The different point of the eighth embodiment from above-mentioned plural embodiments is that a forward 
motion vector MV21 of a block MB2 in a reference picture refers to a reference picture stored in the long term picture 
buffer. 

[0177] The short term picture buffer is a buffer for storing reference pictures temporarily, and, for example, pictures 
are stored in order in which pictures are stored in a buffer (that is, coding/decoding order). When pictures are newly 
stored in the buffer there is not enough storage capacity, pictures are deleted from a picture stored most previously in 
the buffer 

[0178] In the long term picture buffer, pictures are not always stored in the long term picture buffer in time order as 
the short term picture buffer. For example, as an order of storing pictures, time order of pictures may be corresponded, 
and order of address in a buffer in which pictures are stored may be corresponded. Therefore, it is impossible to scale 
a motion vector MV21 referring to a picture stored in the long term picture buffer based on a time interval. 
[0179] A long term picture buffer is not for storing a reference picture temporarily as the short term picture buffer, 
but for storing a reference picture continuously. Therefore, a time interval corresponding to a motion vector stored in 
the long term picture buffer is much wider than a time interval corresponding to a motion vector stored in the short term 
picture buffer. 

[0180] In Fig. 37, a boundary between the long term picture buffer and the short term picture buffer is indicated by 
a dotted vertical line as shown in the figure, and information about pictures on the left side of the dotted vertical line is 
stored in the long term picture buffer, and information about pictures on the right side of the dotted vertical line is stored 
in short term picture buffer. Here, a block MB1 in a picture P23 is a current block. A block MB2 is a co-located reference 
block of the block MB1 in a picture P24. A forward motion vector MV21 of the block MB2 in the reference picture P24 
is the first motion vector referring to a picture P21 stored in the long term picture buffer as the first reference picture, 
and a backward motion vector MV25 of the block MB2 in the reference picture P24 is the second motion vector referring 
to a picture P25 stored in the short term picture buffer as the second reference picture. 

[0181] As mentioned above, a time interval TR21 between the pictures P21 and P24 is corresponded to a forward 
motion vector MV21 referring to a picture stored in the long term picture buffer, a time interval TR25 between the 
pictures P24 and P25 is corresponded to a backward motion vector MV25 referring to a picture stored in the short term 
picture buffer, and the time interval TR21 between the pictures P21 and P24 can become much wider than the time 
interval TR25 between the pictures P24 and P25 or can be undefined. 

[0182] Therefore, a motion vector of the block MB1 in the current picture P23 is not calculated by scaling a motion 
vector of the block MB2 in the reference picture P24 as previous embodiments, but the motion vector is calculated 
using a following method. 

MV21 = MV2V 



MV24' = 0 
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[0183] The upper equation shows that the first motion vector MV21 stored in the long term picture buffer is used 
directly as the first motion vector MV2V in a current picture. 

[0184] The lower equation shows that since the second motion vector MV24', which refers to the picture 24 stored 
in the short term picture buffer, of the block MB1 in the current picture for the picture P23 is enough smaller than the 

s first motion vector MV2V, MV24' is negligible. The second motion vector MV24* is treated as "0". 

[0185] As described above, a reference block MB contains one motion vector referring to a reference picture stored 
in the long term picture buffer as the first reference picture; and one motion vector referring to a reference picture 
stored in the short term picture buffer as the second reference picture. In this case, bi-prediction is performed using 
the motion vector stored in the long term picture buffer out of motion vectors of the block in the reference picture directly 

10 as a motion vector of a block in a current picture. 

[01 86] Note that a reference picture stored in the long term picture buffer may be either the first reference picture or 
the second reference picture, and a motion vector MV21 referring to a reference picture stored in the long term picture 
buffer may be a backward motion vector. Moreover, when the second reference picture is stored in the long term picture 
buffer and the first reference picture is stored in the short term picture buffer, a motion vector in a current picture is 

15 calculated by scaling a motion vector referring to the first reference picture. 

[01 87] This makes it possible to perform bi-prediction without using time which is considerably large in the long term 
picture buffer or undefined. 

[0188] Note that bi-prediction may be performed not directly using a motion vector referred to but using a motion 
vector by multiplying by a constant number. 
20 [0189] In addition, a constant used for the multiplication may be variable when coding or decoding is performed on 
plural blocks basts or on plural pictures basis. 

(Ninth Embodiment) 

25 [0190] In this embodiment, bi-prediction in the direct mode is shown. In this case, a reference block MB in a reference 
picture contains two forward motion vectors referring to a reference picture stored in a long term picture buffer. 
[0191] Fig. 38 is an illustration showing bi-prediction in the direct mode when a reference block MB in a reference 
picture contains two forward motion vectors referring to a reference picture stored in the long term picture buffer. 
[0192] The different point of the ninth embodiment from the eighth embodiment is that both motion vectors MV21 

30 and MV22 of a block MB2 in a reference picture refer to a picture stored in the long term picture buffer. 

[0193] In Fig. 38, a boundary between the long term picture buffer and the short term picture buffer is indicated by 
a dotted vertical line as shown in the figure, and information about pictures on the left side of the dotted vertical line is 
stored in the long term picture buffer and information about pictures on the right side of the dotted vertical line is stored 
in short term picture buffer. Motion vectors MV21 and MV22 of the block MB2 in a reference picture P24 both refer to 

35 a picture stored in the long term picture buffer. The motion vector MV21 corresponds to a reference picture P21 , and 
the motion vector MV22 corresponds to a reference picture P22. 

[01 94] A time interval TR22 between the pictures P22 and P24 can become much wider than the time interval TR25 
between the pictures P24 and P25 or can be undefined corresponding to the motion vector M V22 referring to the picture 
P22 stored in the long term picture buffer. 
40 [0195] In Fig. 3B, pictures are stored in order of pictures P22 - P21 in that order in the long term picture buffer. Here 
the picture P21 corresponds to a motion vector MV21 and the picture P22 corresponds to a motion vector MV22. In 
Fig. 38, a motion vector of a block MB1 in a current picture is calculated as follows. 

A - MV22' = MV22 



MV24' = 0 

50 [0196] The upper equation shows that a motion vector MV22 referring to a picture P22 to which the smallest order 
is assigned is used directly as a motion vector MV22' of the block MB1 in a current picture P23. 
[0197] The lower equation shows that since the backward motion vector MV24' of the block MB1 in the current picture 
P23 stored in the short term picture buffer is enough smaller than the motion vector MV21\ MV24' is negligible. The 
backward motion vector MV24' is treated as "O". 

55 [0198] As described above, by directly using a motion vector referring to a picture to which the smallest order is 
assigned out of motion vectors of a block in a reference picture stored in the long term picture buffer, bi-prediction can 
be made without using time which is considerably large in the long term picture buffer or undefined. 
[0199] Note that bi-prediction may be made not directly using a motion vector referred to but using a motion vector 
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by multiplying by a constant number. 

[0200] In addition, a constant used for the multiplication may be variable when coding or decoding is performed on 
plural blocks basis or on plural pictures basis. 

[0201] Furthermore, when motion vectors MV21 and MV22 of the block MB2 in a reference picture both refer to a 
5 picture stored in the long term picture buffer, a motion vector referring to the first reference picture may be selected. 
For example, when MV21 is a motion vector referring to the first reference picture and MV22 is a motion vector referring 
to the second reference picture, the motion vector MV21 referring to a picture P21 and a motion vector "0" referring to 
a picture P24 are used as motion vectors of a block MB1 . 

10 (Tenth Embodiment) 

[0202] In this embodiment, a calculation method of a motion vector in the direct mode shown in the fifth embodiment 
through the ninth embodiment is explained. This calculation method of a motion vector is applied to either of coding 
and decoding a picture. Here, a current block to be coded or decoded is called a current block MB. A co-located block 

15 of the current block MB in a reference picture is called a reference block. 

[0203] Fig. 39 is an illustration showing a process flow of a motion vector calculation method of this embodiment. 
[0204] First, it is judged if a reference block MB in a backward reference picture referred by a current block MB 
contains a motion vector (step S1). If the reference block MB does not contain a motion vector (No in step S1), bi- 
prediction is performed assuming that a motion vector is "0" (step S2) and a process for a motion vector calculation is 

20 completed. 

[0205] If the reference block MB contains a motion vector (Yes in step S 1 ) , it is judged if the reference block contains 
a forward motion vector (step S3). 

[0206] If the reference block does not contain a forward motion vector (No in step S3), since the reference block MB 
contains only a backward motion vector, the number of backward motion vectors is judged (step S14). When the number 
25 of backward motion vectors of the reference block MB is "2", bi-prediction is performed using two backward motion 
vectors scaled based on one of the calculation method mentioned in Figs. 17, 18, 19 and 20. 

[0207] On the other hand, when the number of backward motion vectors of the reference block MB is "1 ", the only 
backward motion vector contained in the reference block MB is scaled and motion compensation is performed using 
the scaled backward motion vector (step S1 6). After completing the bi-prediction in the step S15 or S16, a process of 
30 the motion vector calculation method is completed. 

[0208] On the other hand, if the reference block MB contains a forward motion vector (Yes in step S3), the number 
of forward motion vectors of the reference block MB is judged (step S4). 

[0209] When the number of forward motion vectors of the reference block MB is "1 ", it is judged if a reference picture 
corresponding to the forward motion vector of the reference block MB is stored in the long term picture buffer or the 
35 short term picture buffer (step S5). 

[0210] When the reference picture corresponding to the forward motion vector of the reference block MB is stored 
in the short term picture buffer, the forward motion vector of the reference block MB is scaled and bi-prediction is 
performed using the scaled forward motion vector (step S6). 

[0211] When the reference picture corresponding to the forward motion vector of the reference block MB is stored 
40 in the long term picture buffer, bi-prediction is performed based on the motion vector calculation method shown in Fig. 
37 assuming that a backward motion vector is 0 and using the forward motion vector of the reference block MB directly 
without scaling (step S7). After completing the bi-prediction in the step S6 or S7, a process of the motion vector cal- 
culation method is completed. 

[0212] When the number of forward motion vectors of the reference block MB is "2", the number of forward motion 
45 vectors corresponding to a reference picture stored in the long term picture buffer is judged (step S8). 

[0213] When the number of forward motion vectors corresponding to a reference picture stored in the long term 
picture buffer is "0 M in the step S8, a motion vector which is temporally close to a current picture containing the current 
block MB is scaled and bi-prediction is performed using the scaled forward motion vector based on the motion vector 
calculation method shown in Fig. 14 (step S9). 
so [0214] When the number of forward motion vectors corresponding to a reference picture stored in the long term 
picture buffer is a 1" in the step S8, a motion vector in a picture stored in the short term picture buffer is scaled and bi- 
prediction is performed using the scaled motion vector (step S10). 

[0215] When the number of forward motion vectors corresponding to a reference picture stored in the long term 
picture buffer is "2 a in the step S8, it is judged if a same picture in the long term picture buffer is referred to by both of 
55 two forward motion vectors (step S1 1 ). If the same picture in the long term picture buffer is referred to by both of two 
forward motion vectors (Yes in step S11 ), bi-prediction is performed using a motion vector previously coded or decoded 
in the picture referred to by two forward motion vectors in the long term picture buffer based on the motion vector 
calculation method shown in Fig. 13 (step S12). 
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[0216] If a same picture in the long term picture buffer is not referred to by both of two forward motion vectors (No 
in step S11 ), bi-prediction is performed using a forward motion vector corresponding to a picture to which a small order 
is assigned in the longlerm picture buffer (step S13). In the long term picture buffer, since reference pictures are stored 
regardless of actual time of pictures, a forward motion vector to be used for bi-prediction is selected in concordance 
s with an order assigned to each reference picture. There is a case that the order of reference pictures stored in the long 
term picture buffer coincides with time of pictures, however, it may be merely coincided with an address in the buffer. 
In other words, the order of pictures stored in the long term picture buffer does not necessarily coincide with time of 
pictures. After completing bi-prediction in steps S12 and S13, process of the motion vector calculation method is com- 
pleted. 

10 

(Eleventh Embodiment) 

[021 7] Detailed explanation of the eleventh embodiment according to the present invention will be given below using 
illustrations. 

is [0218] Fig. 40 is a block diagram showing a configuration of a moving picture coding apparatus 1100 according to 
the eleventh embodiment of the present invention. The moving picture coding apparatus 1100 is an apparatus which 
can code moving pictures by applying a spatial prediction method in the direct mode even if a block coded in a field 
structure and a block coded in a frame structure are mixed, and includes a frame memory 1 1 01 , a difference calculating 
unit 1102, a predictive difference coding unit 1103, a bit stream generating unit 1104, a predictive difference decoding 

20 unit 1105, an add operating unit 1106, a frame memory 1107, a motion vector detecting unit 1108, a mode selecting 
unit 1109, a coding control unit 110, a switch 1111 , a switch 1112, a switch 1113, a switch 1114, a switch 1115 and a 
motion vector storing unit 1116. 

[0219] The frame memory 1101 is a picture memory storing inputted pictures on picture basis. The difference cal- 
culating unit 1102 calculates prediction error, which is difference between an inputted picture from the frame memory 

25 1 1 01 and a reference picture obtained from a decoded picture based on a motion vector, and outputs it. The prediction 
difference coding unit 1103 performs frequency conversion for the prediction error obtained in the difference calculating 
unit 1102, quantizes and outputs it. The bit stream generating unit 1104 converts into a format of output coded bit 
stream after performing variable length coding of the coded result from the predictive difference coding unit 1 103, and 
generates a bit stream adding additional information such as header information in which related information on the 

30 coded prediction error is described. The predictive difference decoding unit 1 1 05 performs variable length coding and 
inverse quantization of the coded result from the predictive difference coding unit 1103, and after that performs inverse 
frequency conversion such as IDCT conversion after performing, and decodes the coded result to output predictive 
residual. The add operating unit 1 106 adds a predictive residual as a decoded result, to the above-mentioned reference 
picture, and outputs a reference picture showing a same picture as an inputted picture by coded and decoded picture 

35 data: The frame memory 11 07 is a picture memory storing reference pictures on picture basis. 

[0220] The motion vector detecting unit 11 08 derives a motion vector for every coding of a current frame to be coded. 
The mode selecting unit 1 1 09 selects if calculation of a motion vector is performed in the direct mode or in other mode. 
The coding control unit 1110 reorders inputted pictures stored in the frame memory 1101 in input order to coding order. 
Additionally, the coding control unit 1110 judges which of a field structure or a frame structure is used for coding for 

40 every predetermined size of a current frame to be coded. Here, the predetermined size is a size of two macroblocks 
(for example, 1 6 (horizontal) X 1 6 (vertical) pixels) combined vertically (hereinafter, macroblock pair). If a field structure 
is used for coding, a pixel value is read every other horizontal scanning line corresponding to interlace from the frame 
memory 1101, if a frame basis is used for coding, each pixel value in inputted picture is read sequentially from the 
frame memory 1101 and each read pixel value is placed on the memory in order to configure a current macroblock 

45 pair to be coded corresponding to the field structure or the frame structure. The motion vector storing unit 1 1 1 6 stores 
a motion vector of a coded macroblock and reference indices of frames referred to by the motion vector. Reference 
indices are stored for each macroblock of coded macro block pairs. 

[0221] Next, an operation of the moving picture coding apparatus configured as above is explained. Pictures to be 
inputted are inputted into the frame memory 1 101 on picture basis in time order. Fig. 41 A is an illustration showing an 

so order of frames inputted into the moving picture coding apparatus 100 on picture basis in time order. Fig. 41 B is an 
illustration showing an order of pictures reordering the order of pictures shown in Fig. 41 A to coding order. In Fig. 41 A, 
a vertical lines indicate pictures, and the number indicated on the lower right side each picture shows the picture types 
(I, P and B) with the first alphabet letters and the picture numbers in time order with following numbers. Fig. 42 is an 
illustration showing a structure of a reference frame list 300 to explain the eleventh embodiment. Each picture inputted 

55 into the frame memory 1101 is reordered in coding order by the coding control unit 1110. Pictures are reordered in 
coding order based on referential relation of an inter picture prediction coding, and in the coding order, a picture used 
as a reference picture is coded previously to a picture referring to a picture. 

[0222] For example, it is assumed that a P picture uses one of preceding and neighboring three I or P pictures as a 
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reference picture. On the other hand, it is assumed that a B picture uses one of preceding and neighboring three I or 
P pictures and one of following and neighboring I or P picture as a reference picture. Specifically, a picture P7 which 
is inputted after pictures B5 and B6 in Fig. 41 A is reordered and placed before pictures B5 and B6 since the picture 
P7 is referred to by pictures B5 and B6. Likewise, a picture P10 inputted after pictures B8 and B9 is reordered and 
5 placed before pictures B8 and B9, and a picture P13 inputted after pictures B11 and B12 is reordered and placed 
before pictures B11 and B12. Hence, the result of reordering an order of pictures shown in Fig. 41 A is as shown in Fig. 
41 B. 

[0223] It is assumed that each picture reordered by the frame memory 1101 is read on macroblock pair basis, and 
each macroblock pair is 1 6 (horizontal) x 1 6 (vertical) pixels in size. Here a macroblock pair combines two macroblocks 
10 vertically. Therefore, a macroblock pair is 16 (horizontal) X 32 (vertical) pixels in size. A process of coding a picture 
B11 is explained below. Note that in this embodiment it is assumed that the coding control unit 1 11 0 controls reference 
indices, that is, a reference frame list. 

[0224] Since the picture B1 1 is a B picture, inter picture prediction coding is performed using bi-directional reference. 
It is assumed that the picture B1 1 uses two of preceding pictures P1 0, P7, P4, and a following picture P1 3 as a reference 

15 picture. Additionally, it is assumed that selection of two pictures from these four pictures can be specified on macroblock 
basis. Here, it is assumed that reference indices are assigned using a method of initial condition. In other words, a 
reference frame list 300 during coding the picture B11 is as shown in Fig. 42. Regarding a reference picture in this 
case, the first reference picture is specified by the first reference index in Fig. 42 and the second reference picture is 
specified by the second reference index. 

20 [0225] In a process for the picture B11 , it is assumed that the coding control unit 1110 controls the switch 1113 to be 
on, the switches 1114 and 1 115 to be off. Therefore, a macroblock pair in the picture B1 1 read from the frame memory 
1101 is inputted into the motion vector detecting unit 1 1 08, the mode selecting unit 1 09 and difference calculating unit 
1102. In the motion vector detecting unit 108, by using decoded data of pictures P10, P7 and P4 stored in the frame 
memory 1 1 07 as a reference picture, the first motion vector and the second motion vector of each macroblock contained 

25 in a macroblock pair is derived. In the mode selecting unit 11 09, a coding mode for a macroblock pair is determined 
using motion vectors derived in the motion vector detecting unit 1108. Here, it is assumed that coding mode for a B 
picture may be selected from, for example, intra picture coding, inter picture prediction coding using uni-directional 
motion vector, inter picture prediction coding using bi-directional motion vector and a direct mode. When coding modes 
other than the direct mode are selected, it is determined which one of a frame structure or a field structure is used for 

30 coding a macroblock pair at the same time. 

[0226] Here, a motion vector calculation method using a spatial predicting method in the direct mode is explained. 
Fig. 43A is a flow chart showing an example of a motion vector calculation procedure using a spatial predicting method 
in the direct mode when a macroblock pair to be coded in a field structure and a macroblock pair to be coded in a frame 
structure are mixed. 

35 [0227] Fig. 43B is an illustration showing an example of a location of neighboring macroblock pairs to which the 
present invention is applied when a current macroblock pair to be coded is coded in a frame structure. Fig. 43C is an 
illustration showing an example of location of neighboring macroblock pairs to which the present invention is applied 
when a current macroblock pair to be coded is coded in a field structure. A macroblock pair diagonally shaded in Figs. 
43B and 43C is a current macroblock pair to be coded. 

40 [0228] When a current macroblock pair to be coded is coded using a spatial prediction in the direct mode, three 
coded macroblock pairs in the neighbor of the current macroblock pair to be coded are selected. In this case, the 
current macroblock pair may be coded in either of a field structure and a frame structure. Therefore, the coding control 
unit 1110 first determines which one of a field structure or a frame structure is used for coding a current macroblock 
pair to be coded. For example, when the number of neighboring macroblock pairs coded in a field structure is great, 

45 a current macroblock pair is coded in a field structure, and if the number of neighboring macroblock pairs coded in a 
frame structure is great, a current macroblock pair is coded in a field structure. Thus, by determining which one of a 
field structure or a frame structure is used for coding a current macroblock pair to be coded by using information on 
neighboring blocks, it is not necessary to describe information showing which structure is used for coding a current 
macroblock pair to be coded in a bit stream. Additionally, since a structure is predicted from neighboring macroblock 

50 pairs, it is possible to select an adequate structure. 

[0229] Next, the motion vector detecting unit 1108 calculates a motion vector of a current macroblock pair to be 
coded according to determination of the coding control 1110. First, the motion vector detecting unit 1108 checks which 
one of a field structure or a frame structure is determined to be used for coding by the control unit 110 (S301), and 
when the frame structure is determined to be used for coding, a motion vector of a current macroblock pair to be coded 

55 is derived using the frame structure (S302), and when the field structure is determined to be used for coding, a motion 
vector of the current macroblock pair to be coded is derived using the field structure (S303). 

[0230] Fig. 44 is an illustration showing a data configuration of a macroblock pair when coding is performed using a 
frame structure, and a data configuration of a macroblock pair when coding is performed using a field structure. In Fig. 
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44, a white circle indicates a pixel on odd-numbered horizontal scanning lines, and a black circle shaded with a hatch 
pattern of oblique lines indicates a pixel on even -numbered horizontal scanning lines. When a macroblock pair is cut 
from each frame showing an inputted picture, pixels on odd-numbered horizontal scanning lines and pixels on even- 
numbered horizontal scanning lines are placed alternately in a vertical direction as shown in Fig. 44. When the above- 
mentioned macroblock pair is coded in the frame structure, a process is performed every macroblock MB1 and every 
macroblock MB2 for the macroblock pair, and a motion vector is calculated for each of macroblocks MB1 and MB2 
forming a macroblock pair. When the macroblock pair is coded in the field structure, the macroblock pair is divided into 
a macroblocks TF and BF. Here, the macroblock TF indicates a top field and the macroblock BF indicates a bottom 
field when interlacing a macroblock pair in a horizontal scanning line direction, and the two fields form a macroblock 
pair. Then one motion vector is calculated for the two fields respectively. 

[0231] Based on such a macroblock pair, the case that a current macroblock pair to be coded is coded in a frame 
structure is explained as shown in Fig. 43B. Fig. 45 is a flow chart showing a detailed processing procedure in a step 
S302 shown in Fig. 43. Note that in Fig. 45 a macroblock pair is indicated as MBP, and a macroblock is indicated as MB. 
[0232] The mode selecting unit 1109 first calculates a motion vector of a macroblock MB1 (an upper macroblock), 
which is one of macroblocks forming a current macroblock pair to be coded, using a spatial prediction in the direct 
mode. First, the mode selecting unit 1 1 09 calculates the smallest value of indices in pictures referred to by neighboring 
macroblock pairs for the first and the second indices respectively (S501 ). In this case, however, when a neighboring 
macroblock pair is coded in the frame structure, the value is determined using only a macroblock adjacent to a current 
macroblock to be coded. Next, it is checked if neighboring macroblock pairs are coded in the field structure (S502), 
and if coding is performed using the field structure, it is further checked the number of fields to which the smallest index 
are assigned in fields referred to by two macroblocks forming the neighboring macroblock pairs from a reference frame 
list in Fig. 42 (S503). 

[0233] When the check result of the step S503 shows that the smallest index is assigned to either field referred to 
by the two macroblocks (that is, fields to which the same index is assigned), an average value of motion vectors of two 
macroblocks is calculated and made to be a motion vector of the neighboring macroblock pair. This is since, when 
considering based on a interlace structure, two macroblocks of neighboring macroblock pairs with the field structure 
are adjacent to a current macroblock to be coded with the frame structure. 

[0234] When the check result of the step S503 shows that the smallest index is assigned to only a field referred to 
by one macroblock; a motion vector of the macroblock is determined as a motion vector of the neighboring macroblock 
pair (S504A). When the smallest index is assigned to none of fields referred to, a motion vector of the neighboring 
macroblock pair is assumed to be "0" (S505). 

[0235] In above cases, from motion vectors of the neighboring macroblocks, by using only motion vectors referring 
to fields to which the smallest index is assigned, it is possible to select a motion vector with higher coding efficiency. 
A process in a step S505 shows that there is no adequate motion vector for prediction. 

[0236] When the check result of the step S502 shows the neighboring macroblock pairs are coded in the frame 
structure, among the neighboring macroblock pairs, a motion vector of a macroblock adjacent to a current macroblock 
to be coded is determined as a motion vector of the neighboring macroblock pair (S506). 

[0237] The mode selecting unit 1109 repeats processes from above steps S501 to S506 for selected three neigh- 
boring macroblock pairs. As a result, a motion vector is calculated for each of three neighboring macroblock pairs as 
for one macroblock of a current macroblock pair to be coded, for example, a macroblock MB1 . 
[0238] Next, the mode selecting unit 1 1 09 checks if the number of neighboring macroblock pairs referring to a frame 
with the smallest index or a field of the frame among three neighboring macroblock pairs is 1 (S507). 
[0239] In this case, the mode selecting unit 1 1 09 unifies reference indices of three neighboring macroblock pairs to 
a reference frame index or a reference field index, and compares them. In a reference frame list shown in Fig. 42, 
reference indices are merely assigned to every frame, however, since relation between the reference frame indices 
and reference field indices to which indices are assigned every field are constant, it is possible to convert one of 
reference frame list or a reference field list into another reference indices by calculation. 

[0240] Fig. 46 is an indicator chart showing a relation between reference field indices and reference frame indices. 
[0241] As shown in Fig. 46, there are several frames indicated by the first field f1 and the second field f2 in chrono- 
logical order in a reference field list, and reference frame indices such as 0, 1 , and 2 are assigned to each frame based 
on frames including a current block to be coded (frames shown in Fig. 46). In addition, reference field indices such as 
0, 1 and 2 are assigned to the first field f1 and the second field f2 of each frame based on the first field f 1 of a frame 
including a current block to be coded (when the first field is a current field to be coded). Note that the reference field 
indices are assigned from the first field f 1 and the second field f2 of a frame close to a current field to be coded. Here, 
if the current block to be coded is the first field f 1 , the indices are assigned giving priority to the first field f 1 , and if the 
current block to be coded is the second field f2, the indices are assigned giving priority to the second field f2. 
[0242] For example, when a neighboring macroblock coded in the frame structure refers to a frame with a reference 
frame index "1" and a neighboring macroblock coded in the field structure refers to the first field f1 with a reference 
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field index "2", the above-mentioned neighboring macroblocks are treated as they refer to a same picture, in other 
words, when a precondition that a reference frame index referred to by one neighboring macroblock is equal to half 
the reference field index (round down after decimal point) assigned to a reference field of another neighboring mac- 
roblock is satisfied, the neighboring macroblocks are treated as they refer to a same picture. 

[0243] For example, when a current block to be coded included in the first field f 1 indicated by A in Fig. 46 refers to 
the first field f1 with the reference field index M 2 M , and a neighboring macroblock with the frame structure refers to a 
frame with the reference frame index M 1 ", the above-mentioned neighboring blocks are treated as they refer to a same 
picture since the above-mentioned precondition is satisfied. On the other hand, when a neighboring macroblock refers 
to the first field with a reference field index "2" and other neighboring macroblock refers to a frame with a reference 
frame index "3", the neighboring blocks are treated as they do not refer to a same picture since the precondition is not 
satisfied. 

[0244] As mentioned above, if the check result of the step S507 shows the number is 1 , a motion vector of a neigh- 
boring macroblock pair referring to a field a frame with the smallest index or a field in the frame is determined as a 
motion vector of a current macroblock to be coded (S508). If the check result of the step S507 shows the number is 
not 1 , it is further checked if the number of neighboring macroblock pairs of three neighboring macroblock pairs referring 
to a frame with the smallest index or a field in the frame is 2 or more (S509). Then if the number is 2 or more, assuming 
that a motion vector of neighboring macroblock pairs not referring to a frame with the smallest index or a field in the 
frame is "0" (S510), a median value of three motion vectors of the neighboring macroblock pairs is determined as a 
motion vector of a current macroblock to be coded (S51 1 ). If the check result of the step S509 is less than 2, since the 
number of the neighboring macroblock pairs referring to the frame with the smallest index or the field in the frame is 
"0 M , a motion vector of a current macroblock to be coded is determined as "O" (S512). 

[0245] As a result of the above process, one motion vector MV1 can be obtained as a calculation result for a mac- 
roblock forming a current macroblock pair to be coded, for example, MB1 . The mode selecting unit 109 performs the 
above process for a motion vector with the second reference index, and performs motion compensation by bi-prediction 
using the two obtained motion vectors. However, when none of neighboring macroblocks contains the first orthe second 
motion vector, motion compensation is performed not using a motion vector in the direction indicated by a motion vector 
not contained in the neighboring macroblocks but using a motion vector in uni-direction. Moreover, the same process 
is repeated for the other macroblock in the current macroblock pair to be coded, for example, MB2. As a result, it is 
equal to perform motion compensation in the direct mode for each of two macroblocks in a current macroblock pair to 
be coded. 

[0246] Next, the case that a current macroblock pair to be coded is coded in the field structure as shown in Fig. 43C 
is explained. Fig. 47 is a flow chart showing a detailed processing procedure in a step S303 shown in Fig. 43. The 
mode selecting unit 1 1 09 calculates one motion vector MVt using a spatial prediction in the direct mode for a macroblock 
forming a current macroblock pair to be coded, for example, a macroblock TF corresponding to a top field of the 
macroblock pair. First, the mode selecting unit 1109 calculates the smallest value of indices in pictures referred to by 
neighboring macroblock pairs (S601). However, when the macroblock pairs are processed by the field structure, only 
a macroblock of a field (a top field or a bottom field) same as the current macroblock to be coded is considered. Next, 
it is checked if the neighboring macroblock pairs are coded by the frame structure (S602), and if coding is performed 
using the frame structure, it is further judged if frames referred to by two macroblocks in the neighboring macroblock 
pair are frames with the smallest index based on the index value assigned to each frame by a reference frame list 300 
(S603). 

[0247] If the check result of the step S603 shows that the smallest index is assigned to either of the frames referred 
to by the two macroblocks, an average value of motion vectors of the two macroblocks is calculated, and the calculation 
result is determined as a motion vector of the neighboring macroblock pair (S604). If the check result of the step S603 
shows that one or both of the frames referred to are not frames with the smallest index, it is further checked if a frame 
referred to by either of macroblocks contains the smallest index (S605). If the check result shows that the smallest 
index is assigned to a frame referred to by one of macroblocks, a motion vector of the macroblock is determined as a 
motion vector of the neighboring macroblock pair (S606). On the other hand, if the check result of the step S605 shows 
that none of macroblocks refers to a frame with the smallest index, a motion vector of the neighboring macroblock pair 
is determined as "0" (S607). In above cases, from motion vectors of the neighboring macroblocks, by using only motion 
vectors referring to frames to which the smallest index is assigned, it is possible to select a motion vector with higher 
coding efficiency. A process in a step S607 shows that there is no adequate motion vector for prediction. 
[0248] When the check result of the step S602 shows the neighboring macroblock pairs are coded in the field struc- 
ture, in the neighboring macroblock pair, motion vectors of the whole neighboring macroblock pair is determined as a 
motion vector of the macroblock pair corresponding to a current macroblock in a current macroblock pair to be coded 
(S608). The mode selecting unit 1 09 repeats processes from above steps S601 to S608 for selected three neighboring 
macroblock pairs. As a result, it is equal to obtain a motion vector for three neighboring macroblock pairs respectively 
as for a macroblock of the current macroblock pair to be coded, for example, a macroblock TF. 
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[0249] Next, the motion vector detecting unit 1 08 checks if the number of neighboring macroblock pairs referring to 
a frame with the smallest index among three neighboring macroblock pairs is 1 (S609). If it is 1 , a motion vector of a 
neighboring macroblock pair referring to a frame with the smallest index is determined as a motion vector of the current 
macroblock to be coded (S610). If the check result of the step S609 shows the number is not 1 , it is further checked 
if the number of neighboring macroblock pairs referring to a frame with the smallest index among three neighboring 
macroblock pairs is two or more (S61 1 ). Then if the number is two or more, assuming that a motion vector of neighboring 
macroblock pairs not referring to a frame with the smallest index is "O" (S612), a median value of three motion vectors 
of neighboring macroblock pairs is determined as a motion vector of the current macroblock to be coded (S613). If the 
check result of the step S611 is less than 2, since the number of neighboring macroblock pairs referring to a frame 
with the smallest index is "0", a motion vector of the current macroblock to be coded is determined as "0 n (S614). 
[0250] As a result of the above process, a motion vector MVt can be obtained as a calculation result for a macroblock 
forming a current macroblock pair to be coded, for example, a macroblock TF corresponding to a top field. The mode 
selecting unit 1 09 repeats the above process also for the second motion vector (corresponding to the second reference 
index). As for a macroblock TF, two motion vectors can be obtained by above process, and motion compensation is 
performed using the two motion vectors. However, when none of neighboring macroblocks contains the first or the 
second motion vector, motion compensation is performed not using a motion vector in the direction indicated by a 
motion vector not contained in the neighboring macroblocks but using a motion vector in uni-direction. This is because 
when a neighboring macroblock pair refers to only uni-direction lly, it is conceivable that coding efficiency becomes 
higher when a neighboring macroblock pair also refers to only uni-direction. 

[0251] Moreover, the same process is repeated for another macroblock in the current macroblock pair to be coded, 
for example, BF corresponding to a bottom field. As a result, it is equal to perform motion compensation in the direct 
mode for each of two macroblocks in the current macroblock pair to be coded, for example, the macroblocks TF and BF. 
[0252] Note that in the above cases, when a coding structure for a current macroblock pair to be coded and a coding 
structure for a neighboring macroblock pair are different, a calculation is performed by a process such as calculating 
an average value of motion vectors of two macroblocks in the neighboring macroblock pair, however, the present 
invention is not limited to the above cases. For example, only when a coding structure for a current macroblock pair 
to be coded and a neighboring macroblock pair are the same, a motion vector of the neighboring macroblock pair may 
be used, and when a coding structure for a current macroblock pair to be coded and a neighboring macroblock pair 
are different, a motion vector of the neighboring macroblock pair of which coding structure is different is not used. 
Specifically, 1 . When a current macroblock pair to be coded is coded in the frame structure, only a motion vector of a 
neighboring macroblock pair coded in the frame structure is used. In this case, when none of motion vectors of the 
neighboring macroblock pair coded in the frame structure refers to a frame with the smallest index, a motion vector of 
the current macroblock pair to be coded is determined as "0". When a neighboring macroblock pair is coded in the 
field structure, a motion vector of the neighboring macroblock pair is determined as "0". 

[0253] Next, 2. When a current macroblock pair to be coded is coded in the field structure, only a motion vector of 
a neighboring macroblock pair coded in a field structure is used. In this case, when none of motion vectors of the 
neighboring macroblock pair coded in the field structure refers to a frame with the smallest index, a motion vector of 
the current macroblock pair to be coded is determined as "0". When a neighboring macroblock pair is coded in the 
frame structure, a motion vector of the neighboring macroblock pair is determined as u 0". Thus, after calculating a 
motion vector of each neighboring macroblock pair, 3. When the number of motion vectors obtained by referring to a 
frame with the smallest index or a field in the frame among these motion vectors is only one, the motion vector is 
determined as a motion vector of a current macroblock pair in the direct mode, and if the number is not 1 , a median 
value of three motion vectors is determined as a motion vector of the current macroblock pair in the direct mode. 
[0254] Additionally, in the above cases, which one of a field structure or a frame structure is used for coding a current 
macroblock pair to be coded is determined based on the majority of a coding structure of coded neighboring macroblock 
pairs, however, the present invention is not limited to the above case. A coding structure may be fixed, for example, a 
frame structure is always used for coding in the direct mode, or a field structure is always used for coding in the direct 
mode. In this case, for example, when the field structure and the frame structure are switched for an every current 
frame to be coded, it may be described in a header of whole bit stream or in a frame header of every frame. Switching 
is performed, for example, on sequence basis, GOP basis, picture basis, and slice basis, and in this case, it may be 
described in a corresponding header of a bit stream or the like. Needless to say, even in the above cases, only when 
coding structures used for a current macroblock pair to be coded and a neighboring macroblock pair are the same, a 
motion vector of the current macroblock pair to be coded in the direct mode can be calculated by a method using a 
motion vector of the neighboring macroblock pair. In addition, when transmitting in such as packets, a header part and 
a data part may be separated and transmitted respectively. In this case, the header part and the data part are never 
included in one bit stream. However, as for packets, although a transmission order can be more or less out of sequence, 
a header part corresponding to a corresponding data part is just transmitted in other packet, and there is no difference 
even if it is not included in one bit stream. Thus, through fixing which one of the frame structure or the field structure 
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is used, a process for determining a coding structure by using information of neighbor becomes unnecessary, and a 
process can be simplified. 

[0255] In addition, in the direct mode, after processing a current macroblock pair using both the frame structure and 
the field structure, a coding structure with higher coding efficiency may be selected. In this case, it may be described 

s which one of the frame structure and the field structure is selected in a header part of a macroblock pair under coding. 
Needless to say, even in the above cases, only when coding structures used for a current macroblock pair to be coded 
and a neighboring macroblock pair are the same, a motion vector of the current macroblock pair to be coded in the 
direct mode can be calculated by a method using a motion vector of the neighboring macroblock pair. By using such 
a method, information showing which one of the frame structure or the field structure is used becomes necessary in a 

10 bit stream, however, it is possible to reduce residual data for motion compensation and coding efficiency can be im- 
proved. 

[0256] In the above explanation, the case that motion compensation is performed for neighboring macroblock pairs 
on macroblock size basis, however, motion compensation may be performed on a different size basis. In this case, as 
shown in Figs. 48A and 48B, a motion vector containing pixels located on a, b and c is used as a motion vector of 

15 neighboring macroblock pair for each macroblock of a current macroblock pair to be coded. Fig. 48A shows the case 
processing an upper macroblock, and Fig. 48B shows the case processing a lower macroblock. When structures (a 
frame structure/a field structure) for a current macroblock pair and neighboring macroblock pairs are different, a process 
is performed using a block including pixels located on a, b and c, and a block including pixels located on a', b' and c' 
as shown in Figs. 49A and 49B. Here, locations a', b' and c' are a block included in another macroblock in the same 

20 macroblock pair corresponding to locations of pixels a, b, and c. For example, in the case of Fig. 49A, when coding 
structures (a frame structure/a field structure) for a current macroblock pair and neighboring macroblock pairs are 
different, a motion vector of a block on the left side of an upper current macroblock to be coded is determined using 
motion vectors of BL1 and BL2. In the case of Fig. 49 B, when coding structures (a frame structure/a field structure) 
for a current macroblock pair and neighboring macroblock pairs are different, a motion vector of a block on the left side 

25 of an upper current macroblock to be coded is determined using motion vectors of BL3 and BL4. By using such a 
processing method, even if motion compensation is performed for neighboring macroblock on a size basis using a 
different size from a macroblock, a process in the direct mode can be performed in consideration of the difference of 
the frame structure and the field structure. 

[0257] Moreover, when motion compensation is performed for neighboring macroblock on a size basis different from 
30 a macroblock, by calculating an average value of motion vectors of a block included in the macroblock, the calculated 
value may be a motion vector of the macroblock. Even if motion compensation is performed for neighboring macroblocks 
on a size basis using a different size from a macroblock, a process in the direct mode can be performed in consideration 
of the difference of a frame structure and a field structure. 

[0258] By the way, as mentioned above, a motion vector is derived, and inter picture prediction coding is performed 
35 based on the derived motion vector. As a result, the motion vector derived in the motion vector detecting unit 1 08 and 
the coded predictive error picture are stored in a bit stream on macroblock basis. However, as for a motion vector of 
a macroblock coded in the direct mode, it is merely described that it is coded in the direct mode, and the motion vector 
and reference indices are not described in a bit stream. Fig. 50 is an illustration showing an example of a data config- 
uration of a bit stream 700 generated by a bit stream generating unit 104. As shown Fig. 50, in the bit stream 700 
40 generated by the bit stream generating unit 104, a Header is provided for every Picture. The Header contains, for 
example, items such as an item RPSL showing the change of a reference frame list 1 0, and an item showing a picture 
type of the picture and not shown in this figure, and when an assignment method of the first reference index 12 and 
the second reference index 13 in the frame list 10 is changed form initial settings, an assignment method after change 
is described in the item RPSL. 

45 [0259] On the other hand, coded predictive error is recorded on macroblock basis. For example, when a macroblock 
is coded using a spatial prediction in the direct mode, a motion vector of the macroblock is not described in an item 
Blockl , and information showing a coding mode is the direct mode is described in an item PredType. Here, the item 
Blockl is an item in which a predictive error corresponding to the macroblock is described, and the item PredType 
shows a coding mode of the macroblock. When it is selected which one of the frame structure or the field structure is 

50 used for coding from the viewpoint of the above-mentioned coding efficiency, information showing the choice between 
the frame structure and the field structure is described. Subsequently, coded predictive error is described in an item 
CodedRes. When another macroblock is a macroblock coded in an inter picture prediction coding mode, it is described 
in the item PredType in an item Block2 that a coding mode for the macroblock is the inter prediction coding mode. 
Here, the item CodeRes shows a coding mode described and the item PredType is an item in which a predictive error 

55 corresponding to the macroblock is described. In this case, the first reference index 12 of the macroblock is further 
described in an item RidxO, and the second reference index 13 is further described in an item Ridxl other than the 
coding mode. Reference indices in a block are represented by variable length code words, and the shorter code length 
is assigned to the smaller value. Subsequently, a motion vector of the macroblock during forward frame reference is 
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described in an Item MVO, and a motion vector during backward frame reference is described in an item MV1 . Then 
coded predictive error is described in the item CodeRes. 

[0260] Fig. 51 is a block diagram showing a configuration of a moving picture decoding apparatus 800 which decodes 
the bit stream 700 shown in Fig. 50. The moving picture decoding apparatus 800 is a moving picture decoding apparatus 

s which decodes the bit stream 700 in which a predictive error including a macroblock coded in the direct mode is de- 
scribed, and includes a bit stream analyzing unit 701 , a predictive difference decoding unit 702, a mode decoding unit 
703, a motion compensation decoding unit 705, a motion vector storing unit 706, a frame memory 707, an add operating 
unit 708, switches 709 and 710, and a motion vector decoding unit 711. The bit stream analyzing unit 701 extracts 
various data from inputted bit stream 700. Here, various data includes information such as information on a coding 

10 mode and information on a motion vector or the like. Extracted information on a coding mode is outputted to the mode 
decoding unit 703. On the other hand, extracted information on a motion vector is outputted to the motion vector 
decoding unit 705. Furthermore, extracted predictive difference coding data is outputted to the predictive difference 
decoding unit 702. The predictive difference decoding unit 702 decodes inputted predictive difference coding data and 
generates a predictive difference picture. Generated predictive difference picture is outputted to the switch 709. For 

15 example, when the switch 709 is connected to a terminal b, a predictive difference picture is outputted to the add 
operating unit 708. 

[0261] The mode decoding unit 703 controls the switches 709 and 710 referring to a coding mode information ex- 
tracted form a bit stream. When a coding mode is intra picture coding mode, the mode decoding unit 703 controls to 
connect the switch 709 with a terminal a and to connect the switch 71 0 with a terminal c. Moreover, the mode decoding 

20 unit 703 outputs a coding mode information to the motion compensation decoding unit 705 and the motion vector 
decoding unit 711 . The motion vector decoding unit 711 decodes a coded motion vector inputted from the bit stream 
analyzing unit 701. Decoded reference picture number and a decoded motion vector are stored in the motion vector 
storing unit 706 and outputted to the motion vector compensation decoding unit 705 at the same time. 
[0262] When a coding mode is the direct mode, the mode decoding unit 703 controls to connect the switch 709 with 

25 the terminal b and to connect the switch 71 0 with a terminal d. Moreover, the mode decoding unit 703 outputs a coding 
mode information to the motion compensation decoding unit 705 and the motion vector decoding unit 711 . The motion 
vector decoding unit 71 1 determines a motion vector to be used in the direct mode using a motion vector of neighboring 
macroblock pair and a reference picture number stored in the motion vector storing unit 706, when a coding mode is 
the direct mode. Since the method for determining a motion vector is same as the contents explained for the operation 

30 of the mode selecting unit 109 shown in Fig. 40, the explanation will be omitted here. 

[0263] Based on decoded reference picture number and decoded motion vector, the motion compensation decoding 
unit 705 obtains a motion compensation picture on macroblock basis from the frame memory 707. The obtained motion 
compensation picture is outputted to the add operating unit 708. The frame memory 707 is a memory storing decoding 
pictures on frame basis. The add operating unit 708 adds inputted predictive difference picture to a motion compen- 

35 sation picture, and generates a decoded picture. The generated decoded picture is outputted to the frame memory 707. 
[0264] As mentioned above, according to this embodiment, even if a neighboring macroblock pair coded by a frame 
structure and a neighboring macroblock pair coded by a field structure are mixed in coded neighboring macroblock 
pairs corresponding to a current macroblock pair to be coded include in a spatial prediction method in the direct mode, 
a motion vector can be easily calculated. 

40 [0265] Note that in the above embodiment, the case that each picture is processed on macroblock pair (connecting 
two macroblocks vertically) basis using either a frame structure or a field structure is explained, however, a process 
may be performed by switching a frame structure and a field structure on a different basis, for example, on macroblock 
basis. 

[0266] Moreover, in above embodiment, the case that a macroblock in a B picture is processed in the direct mode, 
^5 however, a macroblock in a P picture can be processed likewise. When coding and decoding a P picture, each block 
performs motion compensation from only one picture, and a reference frame list is only one. Therefore, in order to 
perform the process same as this embodiment in a P picture, the process calculating two motion vectors of a current 
block to be coded/decoded (the first reference frame and the second reference frame) in this embodiment may be 
changed to a process calculating one motion vector. 
50 [0267] • In addition, in the above embodiment, the case that predictive generation of a motion vector used in the direct 
mode is performed using motion vectors of three neighboring macroblock pairs is explained, however, the number of 
neighboring macroblock pairs to be used may be different. For example, a case using only a motion vector of a neigh- 
boring macroblock pair located on the left side is conceivable. 

55 (Twelfth embodiment) 

[0268] Through storing a program to realize a configuration of the picture coding method and the picture decoding 
method mentioned in the above embodiment on a storage medium such as a flexible disk, it becomes possible to easily 
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perform the process mentioned in the above embodiment on an independent computer system. 
[0269] Fig. 52 is an illustration explaining a storage medium which stores a program to realize the picture coding 
method and the decoding method of the above first embodiment through eleventh embodiment on a computer system. 
[0270] Fig. 52B shows an external view of the flexible disk viewed from the front, a configuration of a cross section 

s and the flexible disk, and Fig. 52A shows an example of a physical format of a flexible disk as a body of storage medium. 
A flexible disk FD is contained in a case F, and plural tracks Tr are formed concentrically on the surface of the disk 
from outer to inner radius, and each track is divided into 1 6 sectors Se in angular direction. Therefore, as for the flexible 
disk storing the above-mentioned program, a picture coding method and a picture decoding method as the above 
program are stored in an allocated area on the above-mentioned flexible disk FD. 

10 [0271] Fig. 52C shows a configuration for recording and reading the above-mentioned program on and from a flexible 
disk FD. When above-mentioned program is stored on the flexible disk FD, the , picture coding method and a picture 
decoding method as above-mentioned program are written via a flexible disk drive from a computer system Cs. When 
the above coding or decoding method is constructed in the computer system by the program on the flexible disk, the 
program is read from the flexible disk and transferred to the computer system. 

is [0272] Note that in the above explanation, a flexible disk is used as a storage medium, however it is possible to 
perform likewise using an optical disk. Moreover, a storage medium is not limited to a flexible disk and media capable 
of storing a program such as a CD-ROM, a memory card and a ROM cassette can execute likewise, 
[0273] Applications of the picture coding method and the picture decoding method shown in the above embodiment 
and a system using the applications will be further explained. 

20 [0274] Fig. 53 is a block diagram showing an overall configuration of a content supply system ex100 for realizing 
content distribution service. The area for providing communication service is divided into cells of desired size, and cell 
sites ex107 ~ ex110 which are fixed wireless stations are placed in each cell. 

[0275] In this content supply system ex1 00, for example, the Internet ex1 01 is connected to devices such as a com- 
puter ex111, a PDA (Personal Digital Assistant) ex 11 2, a camera ex113, a cell phone ex114 and a cell phone with a 
25 camera ex1 1 5 via the Internet service provider ex1 02, a telephone network ex1 04 and cell sites ex1 07~ex1 1 0. 

[0276] However, the content supply system ex100 is not limited to the configuration as shown in Fig. 53, and may 
be connected to a combination of any of them. Also, each device may be connected directly to the telephone network 
ex1 04, not through the cell sites ex1 07~ex1 1 0. 

[0277] The camera ex11 3 is a device such as a digital video camera capable of shooting moving pictures. The cell 
30 phone may be a cell phone of a PDC (Personal Digital Communication) system, a CDMA (Code Division Multiple 
Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system or a GSM (Global System for Mobile 
Communications) system, a PUS (Personal Handyphone system) or the like. 

[0278] A streaming server ex1 03 is connected to the camera ex1 1 3 via the cell site ex1 09 and the telephone network 
ex104, and live distribution or the like using the camera ex113 based on the coded data transmitted from the user 

35 becomes possible. Either the camera ex1 13 or the server for transmitting the data may code the data. Also, the moving 
picture data shot by a camera ex116 may be transmitted to the streaming server ex103 via the computer ex111 . The 
camera ex116 is a device such as a digital camera capable of shooting still and moving pictures. Either the camera 
ex1 1 6 or the computer ex1 1 1 may code the moving picture data. An LSI ex1 1 7 included in the computer ex1 1 1 or the 
camera ex11 6 performs coding processing. Software for coding and decoding pictures may be integrated into any type 

40 of storage medium (such as a CD-ROM, a flexible disk and a hard disk) that is a recording medium which is readable 
by the computer ex111 or the like. Furthermore, a cell phone with a camera ex115 may transmit the moving picture 
data. This moving picture data is the data coded by the LSI included in the cell phone ex115. 

[0279] The content supply system ex1 00 codes contents (such as a music live video) shot by users using the camera 
ex1 13, the camera ex1 1 6 or the like in the same manner as the above embodiment and transmits them to the streaming 
45 server ex103, while the streaming server ex103 makes stream distribution of the content data to the clients at their 
request. The clients include the computer ex111, the PDA ex112, the camera ex113, the cell phone ex114 and so on 
capable of decoding the above-mentioned coded data. In the content supply system ex100, the clients can thus receive 
and reproduce the coded data, and further can receive, decode and reproduce the data in real time so as to realize 
personal broadcasting. 

so [0280] When each device in this system performs coding and decoding, the picture coding apparatus or the picture 
decoding apparatus shown in the above embodiment may be used. A cell phone is explained as an example. 
[0281] Fig. 54 is an illustration showing the cell phone ex1 1 5 using the picture coding method and the picture decoding 
method explained in the above embodiments. The cell phone ex115 has an antenna ex201 for communicating with 
the cell site ex110 via radio waves, a camera unit ex203 such as a CCD camera capable of shooting moving and still 

55 pictures, a display unit ex202 such as a liquid crystal display for displaying the data obtained by decoding pictures and 
the like shot by the camera unit ex203 and received by the antenna ex201 , a body unit including a set of operation 
keys ex204, a voice output unit ex208 such as a speaker for outputting voices, a voice input unit 205 such as a micro- 
phone for inputting voices, a storage medium ex207 for storing coded or decoded data such as data of moving or still 
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pictures shot by the camera, data of received e-mails and data of moving or still pictures, and a slot unit ex206 for 
attaching the storage medium ex207 to the cell phone ex1 1 5. The storage medium ex207 stores in itself a flash memory 
element, a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) that is an electrically eras- 
able and rewritable nonvolatile memory, in a plastic case such as a SD card. 

[0282] Next, the cell phone ex115 is explained with reference to Fig. 55. In the cell phone ex1 15, a main control unit 
ex311 for overall controlling each unit of the body unit including the display unit ex202 and the operation keys ex204 
is connected to a power supply circuit unit ex310, an operation input control unit ex304, a picture coding unit ex312, 
a camera interface unit ex303, a LCD (Liquid Crystal Display) control unit ex302 t a picture decoding unit ex309, a 
multiplexing/separating unit ex308, a recording and reading unit ex307, a modem circuit unit ex306 and a voice process- 
ing unit ex305 to each other via a synchronous bus ex313. 

[0283] When a call-end key or a power key is turned ON by a user's operation, the power supply circuit unit ex31 0 
supplies respective components with power from a battery pack so as to activate the digital cell phone with a camera 
ex1 1 5 for making it into a ready state. 

[0284] In the cell phone ex115, the voice processing unit ex305 converts the voice signals received by the voice 
input unit ex205 in conversation mode into digital voice data under the control of the main control unit ex311 including 
a CPU, ROM and RAM, the modem circuit unit ex306 performs spread spectrum processing of the digital voice data, 
and the communication circuit unit ex301 performs digital-to-analog conversion and frequency transform of the data, 
so as to transmit it via the antenna ex201 . Also, in the cell phone ex1 1 5, the communication circuit unit ex301 amplifies 
the data received by the antenna ex201 in conversation mode and performs frequency transform and analog-to-digital 
conversion for the data, the modem circuit unit ex306 performs inverse spread spectrum processing of the data, and 
the voice processing unit ex305 converts it into analog voice data, so as to output it via the voice output unit 208. 
[0285] Furthermore, when transmitting an e-mail in data communication mode, the text data of the e-mail inputted 
by operating the operation keys ex204 on the body unit is sent out to the main control unit ex311 via the operation 
input control unit ex304. In the main control unit ex311 , after the modem circuit unit ex306 performs spread spectrum 
processing of the text data and the communication circuit unit ex301 performs digital-to-analog conversion and fre- 
quency transform for it, the data is transmitted to the cell site ex11 0 via the antenna ex201 . 

[0286] When picture data is transmitted in data communication mode, the picture data shot by the camera unit ex203 
is supplied to the picture coding unit ex312 via the camera interface unit ex303. When it is not transmitted, it is also 
possible to display the picture data shot by the camera unit ex203 directly on the display unit 202 via the camera 
interface unit ex303 and the LCD control unit ex302. 

[0287] The picture coding unit ex312, which includes the picture coding apparatus as explained in the present in- 
vention, compresses and codes the picture data supplied from the camera unit ex203 by the coding method used for 
the picture coding apparatus as shown in the above-mentioned embodiment so as to transform it into coded picture 
data, and sends it out to the multiplexing/separating unit ex308. At this time, the cell phone ex1 1 5 sends out the voices 
received by the voice input unit ex205 during shooting by the camera unit ex203 to the multiplexing/separating unit 
ex308 as digital voice data via the voice processing unit ex305. 

[0288] The multiplexing/separating unit ex308 multiplexes the coded picture data supplied from the picture coding 
unit ex312 and the voice data supplied from the voice processing unit ex305 by a predetermined method, the modem 
circuit unit ex306 performs spread spectrum processing of the multiplexed data obtained as a result of the multiplexing, 
and the communication circuit unit ex301 performs digital-to-analog conversion and frequency transform of the data 
for transmitting via the antenna ex201 . 

[0289] As for receiving data of a moving picture file which is linked to a Web page or the like in data communication 
mode, the modem circuit unit ex306 performs inverse spread spectrum processing of the data received from the cell 
site ex110 via the antenna ex201 , and sends out the multiplexed data obtained as a result of the processing to the 
multiplexing/separating unit ex308. 

[0290] In orderto decode the multiplexed data received via the antenna ex201 , the multiplexing/separating unit ex308 
separates the multiplexed data into a bit stream of picture data and a bit stream of voice data, and supplies the coded 
picture data to the picture decoding unit ex309 and the voice data to the voice processing unit ex305 respectively 
viathe synchronous bus ex313. 

[0291] Next, the picture decoding unit ex309, which includes the picture decoding apparatus as explained in the 
present invention, decodes the bit stream of picture data by the decoding method corresponding to the coding method 
as shown in the above-mentioned embodiment to generate reproduced moving picture data, and supplies this data to 
the display unit ex202 via the LCD control unit ex302, and thus moving picture data included in a moving picture file 
linked to a Web page, for instance, is displayed. At the same time, the voice processing unit ex305 converts the voice 
data into analog voice data, and supplies this data to the voice output unit ex208, and thus voice data included in a 
moving picture file linked to a Web page, for instance, is reproduced. 

[0292] The present invention is not limited to the above-mentioned system, and at least either the picture coding 
apparatus or the picture decoding apparatus in the above-mentioned embodiments can be incorporated into a digital 
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broadcasting system as shown in Fig. 56. Such satellite or terrestrial digital broadcasting has been in the news lately. 
More specifically, a bit stream of video information is transmitted from a broadcast station ex409 to or communicated 
with a broadcast satellite ex41 0 via radio waves. Upon receipt of it, the broadcast satellite ex41 0 transmits radio waves 
for broadcasting, a home-use antenna ex406 with a satellite broadcast reception function receives the radio waves, 

5 and a television (receiver) ex401 or a set top box (STB) ex407 decodes the bit stream for reproduction. The picture 
decoding apparatus as shown in the above-mentioned embodiment can be implemented in the reproducing apparatus 
ex403 for reading off and decoding the bit stream recorded on a storage medium ex402 that is a storage medium such 
as a CD and a DVD. In this case, the reproduced video signals are displayed on a monitor ex404. It is also conceived 
to implement the picture decoding apparatus in the set top box ex407 connected to a cable ex405 for a cable television 

10 or the antenna ex406 for satellite and/or terrestrial broadcasting so as to reproduce them on a monitor ex408 of the 
television ex401 . The picture decoding apparatus may be incorporated into the television, not in the set top box. Or, a 
car ex412 having an antenna ex411 can receive signals from the satellite ex410 or the cell site ex107 for reproducing 
moving pictures on a display apparatus such as a car navigation system ex413. 

[0293] Furthermore, the picture coding apparatus as shown in the above-mentioned embodiment can code picture 
15 signals for recording on a storage medium. As a concrete example, there is a recorder ex420 such as a DVD recorder 
for recording picture signals on a DVD disk ex421 and a disk recorder for recording them on a hard disk. They can be 
recorded on a SD card ex422. If the recorder ex420 includes the picture decoding apparatus as shown in the above- 
mentioned embodiment, the picture signals recorded on the DVD disk ex421 or the SD card ex422 can be reproduced 
for display on the monitor ex408. 
20 [0294] As the structure of the car navigation system ex41 3, the structure without the camera unit ex203, the camera 
interface unit ex303 and the picture coding unit ex312, out of the components shown in Fig. 55, is conceivable. The 
same goes for the computer ex111, the television (receiver) ex401 and others. 

[0295] In addition, three types of implementations can be conceived for a terminal such as the above-mentioned cell 
phone ex114; a sending/receiving terminal implemented with both an encoder and a decoder, a sending terminal im- 
25 plemented with an encoder only, and a receiving terminal implemented with a decoder only. 

[0296] As described above, it is possible to use the moving picture coding method or the moving picture decoding 
method in the above-mentioned embodiments in any of the above-mentioned device and system, and using this meth- 
od, the effects described in the above embodiments can be obtained. 

[0297] From the invention thus described, it will be obvious that the embodiments of the invention may be varied in 
30 many ways. Such variations are not to be regarded as a departure from the spirit and scope of the invention, and all 
such modifications as would be obvious to one skilled in the art are intended for inclusion within the scope of the 
following claims. 

Industrial Applicability 

35 

[0298] The picture coding apparatus according to the present invention is useful as a picture coding apparatus em- 
bedded in a personal computer with communication facility, a PDA, a broadcast station for digital broadcasting and a 
cell phone or the like. 

[0299] The picture decoding apparatus according to the present invention is useful as a picture decoding apparatus 
40 equipped for a personal computer with communication facility, a PDA, a STB for receiving digital broadcasting and a 
cell phone or the like. 



Claims 

45 

1. A motion vector calculation method for inter picture prediction with reference to a plurality of pictures, the motion 
vector calculation method comprising: 

a referring step of referring to a plurality of pictures preceding in display order or a plurality of pictures following 
50 in display order or a plurality of pictures both preceding and following in display order; and 

a motion compensating step of calculating a motion vector of a inter picture predictive block using at least one 
motion vector, which satisfies a predetermined condition, among motion vectors already calculated for a co- 
located block of which motion vector is referred to, when performing motion compensation of the inter picture 
predictive block with reference to a motion vector of the co- located block in a picture other than a picture to 
55 which the inter picture predictive block belongs. 

2. The motion vector calculation method according to Claim 1 , 

wherein in the reference step, each one of pictures selected from a first picture order and a second picture 
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order can be referred to, the first picture order being the order in which identifying numbers are assigned to pictures 
in ascending sequence giving priority to a picture precedent in display order, and the second picture order being 
the order in which identifying numbers are assigned to pictures in ascending sequence giving priority to a picture 
following in display order, 

s in the motion compensating step, a motion vector, which refers to a picture in the first picture order, of the 

block of which motion vector is referred to is used. 

3. The motion vector calculation method according to Claim 2, 

wherein in the motion compensating step, the other picture is a picture having the smallest identifying number 
10 of the pictures in the second picture order, and a motion vector of the inter picture predictive block is calculated 

using the motion vector of the co-located block. 

4. The motion vector calculation method according to Claim 2, 

wherein in the motion compensating step, when the block of which motion vector is referred to contains a 
15 plurality of motion vectors referring to a picture stored in a long term picture buffer, a motion vector referring to a 

picture in the first picture order is used as a motion vector of the inter picture predictive block among motion vectors 
referring to the picture stored in the long term picture buffer. 

5. The motion vector calculation method according to Claim 1 , 

20 wherein in the motion compensating step, the motion vector of the inter picture predictive block is calculated 

using at least one of motion vectors referring to the picture preceding in display order. 

6. The motion vector calculation method according to Claim 1 , 

wherein in the motion compensating step, when the block of which motion vector is referred to contains one 
25 or more motion vectors, the motion vector of the inter picture predictive block is calculated using a motion vector 

coded or decoded previously, or a motion vector previously described in a bit stream. 

7. The motion vector calculation method according to Claim 1 , 

wherein in the motion compensating step, when the block of which motion vector is referred to contains a 
30 plurality of motion vectors referring to one of the pictures preceding or following in display order, a motion vector 

of the inter picture predictive block is calculated using at least one of the plurality of motion vectors. 

8. The motion vector calculation method according to Claim 1 , 

wherein in the motion compensating step, when the block of which motion vector is referred to contains a 
35 plurality of motion vectors referring to one of the pictures preceding and following in display order, a motion vector 

for performing motion compensation is calculated using one motion vector referring to a picture close to an inter 
picture predictive picture in display order or one motion vector referring to a picture far in display order among the 
plurality of motion vectors. 

40 9. The motion vector calculation method according to one of Claim 1 ~ Claim 8, 

wherein in the motion compensating step, when the block of which motion vector is referred to contains one 
motion vector stored in the long term picture buffer, as the motion vector of the inter picture predictive block, the 
motion vector stored in the long term picture buffer is assigned. 

45 10. The motion vector calculation method according to one of Claim 1 ~ Claim 8, 

wherein in the motion compensating step, when the block of which motion vector is referred to contains at 
least one motion vector referring to a picture stored in the long term picture buffer, and when the motion vector 
referring to the picture stored in the long term picture buffer is a motion vector which refers to a picture preceding 
a picture containing the block of which motion vector is referred to in display order, the motion vector is determined 

50 as the motion vector of the inter picture predictive block. 

11 . The motion vector calculation method according to one of Claim 1 — Claim 8, 

wherein in the motion compensating step, an operation rounding to a predetermined precision of a motion 
vector is further performed in process of calculating the motion vector of the inter picture predictive block or for 
55 the last result. 

1 2. A motion vector calculation method for the case that inter picture prediction is performed using a plurality of pictures, 
the motion vector calculation method comprising: 



37 



EP 1 411 729 



Page 38 of 97 



EP1 411 729 A1 

an assigning step of assigning one of a first reference index or a second reference index to the coded picture, 
the first reference index and the second reference index being used for selecting at least one of a first reference 
picture or a second reference picture which are referred to when obtaining a block in a current picture to be 
coded by motion compensation from a plurality of coded pictures stored in a storing unit; 
5 a first selecting step, when performing motion compensation of a block in the current picture to be coded, there 

are a plurality of motion vectors containing the first reference indices among motion vectors of blocks in the 
neighbor of the current picture to be coded, for selecting a motion vector indicating a median value of the 
motion vectors; and 

a detecting step of detecting a motion vector referring to a picture preceding the current picture to be coded 
to jn display order or a picture following the current picture to be coded in display order or pictures preceding 

and following the current picture to be coded in display order using the motion vector selected in the first 
selecting step. 

13. The motion vector calculation method according to Claim 12, 

15 wherein in the first selecting step, a motion vector indicating a median value of the smallest first reference 

indices is further selected from motion vectors containing the first reference indices. 

14. The motion vector calculation method according to Claim 13, 

wherein the first selecting step further comprises a second selecting step, when in the motion vectors of the 
20 blocks in the neighbor of the current picture to be coded there are a plurality of motion vectors containing the 

second reference indices, for selecting a motion vector indicating a median value of the motion vectors. 

15. The motion vector calculation method according to Claim 14, 

wherein in the second selecting step, a motion vector indicating a median value of the smallest second 
25 reference indices is further selected from the motion vectors containing the second reference indices. 

16. The motion vector calculation method according to Claim15, 

wherein in the detecting step, a motion vector selected in the first selecting step is assumed to be a motion 
vector of the case that the block in the current picture to be coded refers to a picture preceding the current picture 
30 to be coded in display order, and a motion vector selected in the second selecting step is assumed to be a motion 

vector of the case that the block in the current picture to be coded refers to a picture following the current picture 
to be coded in display order. 

17. The motion vector calculation method according to Claim 15, 

35 wherein in the detecting step, a motion vector selected in the first selecting step is assumed to be a motion 

vector of the case that the block in the current picture to be coded refers to the first reference picture, and a motion 
vector selected in the second selecting step is assumed to be a motion vector of the case that the block in the 
current picture to be coded refers to the second reference picture. 

40 18. The motion vector calculation method according to Claim 12, 

wherein when the neighbouring block is a block in the direct mode which performs coding using a motion 
vector of other block, a motion vector used virtually for coding or decoding the other block is assumed to be a 
motion vector of the block in the direct mode. 

45 19. The motion vector calculation method according to Claim 12 comprising instead of the first selecting step: 

a judging step, when motion compensation of the block in the current picture to be coded is performed, for 
determining a motion vector calculation method in concordance with the number of motion vectors containing 
the smallest first reference indices of motion vectors of the blocks in the neighbor of the blocks in the current 
50 picture to be coded, 

wherein in the detecting step, instead of the selected motion vector, a motion vector referring to a picture 
preceding the current picture to be coded in display order or a picture following the current picture to be coded in 
display order or pictures preceding and following the current picture to be coded in display order is derived by 
55 using a determined motion vector detecting method. 
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